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donment are also treated. The Markovian stochastic process representing 
the number of customers in the system is constructed in terms of rate- 
1 Poisson processes in two ways: (i) through random time changes and 
(ii) through random thinnings. Associated martingale representations are 
obtained for these constructions by applying, respectively: (i) optional stop- 
ping theorems where the random time changes are the stopping times and 
(ii) the integration theorem associated with random thinning of a counting 
process. Convergence to the diffusion process limit for the appropriate se- 
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1. Introduction 

The purpose of this paper is to illustrate how to do martingale proofs of 
many-server heavy-traffic limit theorems for queueing models, as in Krichagina 
and Puhalskii [37] and Puhalskii and Rciman [53J . Even though the method is re- 
markably effective, it is somewhat complicated. Thus it is helpful to see how the 
argument applies to simple models before considering more elaborate models. 
For the more elementary Markovian models considered here, these many-server 
heavy-traffic limits were originally established by other methods, but new meth- 
ods of proof have been developed. For the more elementary models, we show 
how key steps in the proof - a FCLT (([FJ])) and a FWLLN (Lemma l4~3)) - can be 
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done without martingales as well as with martingales. For the argument without 
martingales, we follow Mandelbaum and Pats [371135], but without their level 
of generality and without discussing strong approximations. 

Even though we focus on elementary Markov models here, we are motivated 
by the desire to treat more complicated models, such as the non-Markovian 
GI/GI/n models in Krichagina and Puhalskii [37], Puhalskii and Reiman [53] . 
Reed |55j and Kaspi and Ramanan [34] . and network generalizations of these, 
such as non-Markovian generalizations of the network models considered by 
Dai and Tezcan [H [T51 [IB] , Gurvich and Whitt [HIM [25] and references cited 
there. Thus we want to do more than achieve a quick proof for the simple models, 
which need not rely so much on martingales; we want to illustrate martingale 
methods that may prove useful for more complicated models. 

1.1. The Classical Markovian Infinite- Server Model 

We start by focusing on what is a candidate to be the easiest queueing model 
- the classic M/M/oo model - which has a Poisson arrival process (the first M), 
i.i.d. exponential service times (the second M), independent of the arrival pro- 
cess, and infinitely many servers. Let the arrival rate be A and let the individual 
mean service time be 

Afterwards, in $7.11 we treat the associated M/M/n/oo + M (Erlang A or 
Palm) model, which has n servers, unlimited waiting room, the first-come first- 
served (FCFS) service discipline and Markovian customer abandonment (the 
final +M); customers abandon (leave) if they have to spend too long waiting 
in queue. The successive customer times to abandon are assumed to be i.i.d. 
exponential random variables, independent of the history up to that time. That 
assumption is often reasonable with invisible queues, as in telephone call cen- 
ters. Limits for the associated M/M/n/oo (Erlang C) model (without customer 
abandonment) are obtained as a corollary, by simply letting the abandonment 
rate be zero. 

For the initial M/M/oo model, let Q(t) be the number of customers in the 
system at time i, which coincides with the number of busy servers at time t. It is 
well known that Q = {Q(t) : t > 0} is a birth-and-death stochastic process and 
that Q(t) =>■ Q{oo) as t — > oo, where =>- denotes convergence in distribution, 
provided that P(Q(0) < oo) = 1, and Q(oo) has a Poisson distribution with 
mean E[Q(oo)] — X/fi. 

We are interested in heavy-traffic limits in which the arrival rate is allowed 
to increase. Accordingly, we consider a sequence of models indexed by n and let 
the arrival rate in model n be 

A„ = n/i, n > 1 . (I) 

Let Q n (t) be the number of customers in the system at time t in model n. By the 
observation above, Q n (oo) has a Poisson distribution with mean E[Q n (oo)] = 
A„//i = n. Since the Poisson distribution approaches the normal distribution as 
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the mean increases, we know that 
Qn(oo) - n 



2V(0,1) as n->oo, (2) 



where N(0, 1) denotes a standard normal random variables (with mean and 
variance 1). 

However, we want to establish a limit for the entire stochastic process {Q n (t) : 
t > 0} as n — ► oo. For that purpose, we consider the scaled processes X n = 
{X n (t) :t>0} defined by 

X n (t)= Qn{t) ~ n , t>0. (3) 
y/n 

To establish the stochastic-process limit in ([3]) , we have to be careful about the 
initial conditions. We will assume that X n (0) X(0) as n — > oo. In addition, we 
assume that the random initial number of busy servers, Q n (0), is independent 
of the arrival process and the service times. Since the service-time distribution 
is exponential, the remaining service times of those customers initially in ser- 
vice have independent exponential distributions because of the lack-of-memory 
property of the exponential distribution. 

The heavy-traffic limit theorem asserts that the sequence of stochastic pro- 
cesses {X n : n > 1} converges in distribution in the function space D = 
-D([0, oo), R) to the Ornstein-Uhlenbeck (OU) diffusion process asm oo, pro- 
vided that appropriate initial conditions are in force; see Billingsley [8] and 
Whitt [69] for background on such stochastic-process limits. 

Here is the theorem for the basic M/M/oo model: 

Theorem 1.1 (heavy-traffic limit in D for the M/M/oo model) Consider the 
sequence of M/M/oo models defined above. If 

X n (0) =>■ X(0) in R as n -> oo , (4) 

then 

X n X in D as n — > oo , 

where X is the OU diffusion process with infinitesimal mean m(x) — —\ix and 
infinitesimal variance a (x) = 2/i. Alternatively, X satisfies the stochastic inte- 
gral equation 

X(t) = X(0) + y/2fiB(t) - n ( X{s)ds, t>0, (5) 

Jo 

where B = {B(t) : t > 0} is a standard Brownian motion. Equivalently, X 
satisfies the stochastic differential equation (SDE) 

dX(t) = -fj,X(t)dt + \J%dB(t), t > . (6) 
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Much of this paper is devoted to four proofs of Theorem 11.11 It is possible 
to base the proof on the martingale functional central limit theorem (FCLT), 
as given in §7.1 of Ethier and Kurtz [19] and here in Sj8] as we will show in 
^9.2i but it is not necessary to do so. Instead, it can be based on the classic 
FCLT for the Poisson process, which is an easy consequence of Donsker's FCLT 
for sums of i.i.d. random variables, and the continuous mapping theorem. Nev- 
ertheless, the martingale structure can still play an important role. With that 
approach, the martingale structure can be used to establish stochastic bounded- 
ness of the scaled queueing processes, which we show implies required fluid limits 
or functional weak laws of large numbers (FWLLN's) for random-time-change 
stochastic processes, needed for an application of the continuous mapping the- 
orem with the composition map. Alternatively (the third method of proof), the 
fluid limit can be established by combining the same continuous mapping with 
the strong law of large numbers (SLLN) for the Poisson process. 

It is also necessary to understand the characterization of the limiting diffu- 
sion process via ([5]) and ((6]). That is aided by the general theory of stochastic 
integration, which can be considered part of the martingale theory [19l [58] . 



1.2. The QED Many-Server Heavy-Traffic Limiting Regime 

We will also establish many-server heavy-traffic limits for Markovian models 
with finitely many servers, where the number n of servers goes to infinity along 
with the arrival rate in the limit. We will consider the sequence of models in the 
quality-and-efficiency (QED) many-server heavy-traffic limiting regime, which 
is defined by the condition 

— A„ 

P= > pa as n — ► oo . (7) 

V n 

This limit in which the arrival rate and number of servers increase together 
according to ((7j) is just the right way so that the probability of delay converges 
to a nondegenerate limit (strictly between and 1); see Halfin and Whitt [25] . 

We will also allow finite waiting rooms of size m n , where the waiting rooms 
grow at rate y/n as n — > oo, i.e., so that 



in 



K > as n — > oo . (8) 



With the spatial scaling by y/n, as in ([3]), this scaling in ([8]) is just right to 
produce a reflecting upper barrier at k in the limit process. 

In addition, we allow Markovian abandonment, with each waiting customer 
abandoning at rate 9. We let the individual service rate fj, and individual aban- 
donment rate 6 be fixed, independent of n. These modifications produce a se- 
quence of M/M/n/m n + M models (with +M indicating abandonment). The 
special case of the Erlang A (abandonment), B (blocking or loss) and C (delay) 
models are obtained, respectively, by (i) letting m n = oo, (ii) letting m n = 0, 
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in which case the +M plays no role, and (iii) letting m n = oo and 9 = 0. So all 
the basic many-server Markovian queueing models are covered. 

Here is the corresponding theorem for the M/M/n/m n + M model. 

Theorem 1.2 (heavy-traffic limit in D for the M/M/n/m n + M model) Con- 
sider the sequence of M/M/n/m n + M models defined above, with the scaling 
in ([7|) and (JSj) . Let X n be as defined in <j3j) . If X n (Q) => X(0) inM. as n — > oo ; 
then X n ^ X in D as n — > oo, where the limit X is the diffusion process with 
infinitesimal mean m(x) = —f3(i — \ix for x < and m(x) — —{3(1 — Ox for 
x > 0, infinitesimal variance o~ 2 (x) = 2/j, and reflecting upper barrier at k. Al- 
ternatively, the limit process X is the unique (— oo, n]-valued process satisfying 
the stochastic integral equation 



for t > 0, where B = {B(t) : t > 0} is a standard Brownian motion and U is 
the unique nondecreasing nonnegative process in D such that ([9]) holds and 



1.3. Literature Review 

A landmark in the application of martingales to queues is the book by 
Bremaud [12]; contributions over the previous decade are described there. As 
reflected by the references cited in Krichagina and Puhalskii [37] and Puhalskii 
and Reiman [53] , and as substantiated by Puhalskii in personal communica- 
tion, Liptser was instrumental in developing the martingale method to prove 
limit theorems for queueing models, leading to diffusion approximations; e.g., 
see Kogan, Liptser and Smorodinskii [36 and §10.4 of Liptser and Shiryaev [44] . 

The specific M/M/oo result in Theorem 11.11 was first established by Iglehart 
[29] : see Borovkov [2 OS], Whitt [68], Glynn and Whitt [22], Massey and Whitt 
[50] , Mandelbaum and Pats [ITJ [48] , Mandelbaum, Massey and Reiman [46] and 
Krichagina and Puhalskii [37] for further discussion and extensions. A closely 
related textbook treatment appears in Chapter 6 of Robert [S7J . Iglehart applied 
a different argument; in particular, he applied Stone's [62] theorem, which shows 
for birth-and-death processes that it suffices to have the infinitesimal means and 
infinitesimal variances converge, plus other regularity conditions; see Garnett et 
al. [5T] and Whitt [7T] for recent uses of that same technique. Iglehart directly 
assumed finitely many servers and let the number of servers increase rapidly with 
the arrival rate; the number of servers increases so rapidly that it is tantamount 
to having infinitely many servers. The case of infinitely many servers can be 
treated by a minor modification of the same argument. 

The corresponding QED finite-server result in Theorem 11.21 in which the 
arrival rate and number of servers increase together according to ([7J, which is 
just the right way so that the probability of delay converges to a nondegenerate 



X(t) = X(0) - 0ut + y^LB(t) - [ [u(X(s) A 0) + 6(X(s) V 0)] ds - U(t) (9) 




(10) 
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limit (strictly between and 1), was considered by Halfin and Whitt [26] for 
the case of infinite waiting room and no abandonment. There have been many 
subsequent results; e.g., see Mandelbaum and Pats [471HH], Srikant and Whitt 
[BTj . Mandelbaum et al. [46], Puhalskii and Reiman [53], Garnett et al. [21], 
Borst et al. p], Jelenkovic et al. [31], Whitt [72], Mandelbaum and Zeltyn [49] 
and Reed |55] for further discussion and extensions. 

Puhalskii and Reiman [53j apply the martingale argument to establish heavy- 
traffic limits in the QED regime. They establish many-server heavy-traffic limits 
for the GI/PH/n/oo model, having renewal arrival process (the GT), phase- 
type service-time distributions (the PH), n servers, unlimited waiting space, 
and the first-come first-served (FCFS) service discipline. They focus on the 
number of customers in each phase of service, which leads to convergence to a 
multi-dimensional diffusion process. One of our four proofs is essentially their 
argument. 

Whitt [72] applied the same martingale argument to treat the G/M/n/m n + 
M model, having a general stationary point process for an arrival process (the 
G), i.i.d. exponential service times, n servers, m n waiting spaces, the FCFS ser- 
vice discipline and customer abandonment with i.i.d exponential times to aban- 
don; see Theorem 3.1 in [72] . Whitt [72] is primarily devoted to a heavy-traffic 
limit for the G/H^/n/rrin model, having a special service-time distribution 
(a mixture of an exponential and a point mass at 0), by a different argument, 
but there is a short treatment of the G/M/n/m n + M model by the martingale 
argument, following Puhalskii and Reiman [53j quite closely. The martingale 
proof is briefly outlined in §5 there. The extension to general arrival processes 
is perhaps the most interesting contribution there, but that generalization can 
also be achieved in other ways. 

For our proof without martingales, we follow Mandelbaum and Pats [47]l48]. 
but without using strong approximations as they do. Subsequent related work 
has established similar asymptotic results for Markovian service networks with 
multiple stations, each with multiple servers, and possibly multiple customer 
classes; e.g., see Armony 1., Armony et al. [2]|3], Atar [4], Atar et al. [HE], Dai 
and Tezcan [HUH HE], Gurvich and Whitt EH [25] , Harrison and Zeevi [27], 
and Tezcan [63 . These complex stochastic networks have important applications 
to telephone call centers; see Gans et al. [2"0] . 

1. 4- Organization 

The rest of this paper is organized as follows: We start in fj2]by constructing 
the Markovian stochastic process Q representing the number of customers in 
the system in terms of rate-1 Poisson processes. We do this in two ways: (i) 
through random time changes and (ii) through random thinnings. We also give 
the construction in terms of arrival and service times used by Krichagina and 
Puhalskii 37 to treat the G/GI/oo model. 

Section <|3] is devoted to martingales. After reviewing basic martingale no- 
tions, we construct martingale representations associated with the different con- 
structions. We justify the first two representations by applying, respectively: (i) 
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optional stopping theorems where the random time changes are the stopping 
times and (ii) the integration theorem associated with random thinning of a 
counting process. The two resulting integral representations are very similar. 
These integral representations are summarized in Theorems 13.41 and 13.51 In §2] 
we also present two other martingale representations, one based on constructing 
counting processes associated with a continuous-time Markov chain, exploiting 
the infinitesimal generator, and the other - for G/GI/oo - based on the sequen- 
tial empirical process (see ([IS))). 

Section |4] is devoted to the main steps of the proof of Theorem 11.11 using the 
first two martingale representations. In §4.11 we show that the integral repre- 
sentation has a unique solution and constitutes a continuous function mapping 
K x D into D. In order to establish measurability, we establish continuity in 
the Skorohod [3U] J\ topology as well as the topology of uniform convergence 
over bounded intervals. As a consequence of the continuity, it suffices to prove 
FCLT's for the scaled martingales in these integral representations. For the 
first martingale representation, the scaled martingales themselves are random 
time changes of the scaled Poisson process. In §4.21 we show that a FCLT for 
the martingales based on this first representation, and thus the scaled queue- 
ing processes, can be obtained by applying the classical FCLT for the Poisson 
process and the continuous mapping theorem with the composition map. 

To carry out the continuous-mapping argument above, we also need to estab- 
lish the required fluid limit or functional weak law of large numbers (FWLLN) 
for the random time changes, needed in the application of the CMT with the 
composition map. In i )4.3[ following Mandelbaum and Pats [171 05], we snow 
that this fluid-limit step can be established by applying the strong law of large 
numbers (SLLN) for the Poisson process with the same continuous mapping de- 
termined by the integral representation. If we do not use martingales, then we 
observe that it is easy to extend the stochastic-process limit to general arrival 
processes satisfying a FCLT. 

But we can also use martingales to establish the FCLT and the FWLLN. 
For the FWLLN, the martingale structure can be exploited via the Lenglart- 
Rebolledo inequality to prove stochastic boundedness, first for the scaled mar- 
tingales and then for the scaled queueing processes, which in turn can be used to 
to establish the required FWLLN for the scaled random time changes - Lemma 

Since this martingale proof of the fluid limit relies on stochastic boundedness, 
which is related to tightness, we present background on these two important con- 
cepts in S}5] For the proof of Theorem we conclude there that it suffices to 
show that the predictable quadratic variations of the square-integrable martin- 
gales are stochastically bounded in R in order to have the sequence of scaled 
queueing processes {X n } be stochastically bounded in D. 

We complete the proof of Theorem 11.11 in §£] In Lemma 15.91 and £16.11 we 
show that stochastic boundedness of {X n } in D implies the desired fluid limit 
or FWLLN needed for the scaled random-time-change processes needed in an 
application of the continuous mapping with the composition function. In i j6.2l wc 
complete the proof by showing that the predictable quadratic variation processes 
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of the martingales are indeed stochastically bounded in R. In §6.31 we show 
that it is possible to remove a moment condition imposed on Q n (0), the initial 
random number of customers in the system, in the martingale representation; 
in particular, we show that it is not necessary to assume that E[Q n (0)] < oo. 
Finally, in §6.41 we state the G/GI /oo limit in Krichagina and Puhalskii [37] 
and show that the special case M/M /oo is consistent with Theorem ll.il 

In §7] we discuss stochastic-process limits for other queueing models. In ij7.ll 
we present a martingale proof of the corresponding many-server heavy-traffic 
limit for the M/M/n/ao + M model. Corresponding results hold for the model 
without customer abandonment by setting the abandonment rate to zero. The 
proof is nearly the same as for the M/M /oo model. The only significant differ- 
ence is the use of the optional stopping theorem for multiple stopping times with 
respect to multiparameter martingales, as in Kurtz [?D], §§2.8 and 6.2 of Ethicr 
and Kurtz [19] and §12 of Mandelbaum and Pats [48]. We discuss the extension 
to cover finite waiting rooms in §7.21 and non-Markovian arrival processes in 

We state a version of the martingale FCLT from p 339 of Ethier and Kurtz 
[H)] in §S] In a companion paper, Whitt [75], we review the proof, elaborating 
on the proof in Ethier and Kurtz [19] and presenting alternative arguments, 
primarily based on Jacod and Shiryayev |30] . We present a more extensive review 
of tightness there. In §13 we show how the martingale FCLT implies both the 
FCLT for a Poisson process and the required FCLT for the scaled martingales 
arising in the second and third martingale representations. 

2. Sample-Path Constructions 

We start by making direct sample-path constructions of the stochastic process 
Q representing the number of customers in the system in terms of independent 
Poisson processes. We show how this can be done in two different ways. After- 
wards, we present a different construction based on arrival and service times, 
which only exploits the fact that the service times are mutually independent 
and independent of the arrival process and the initial number of customers in 
the system (with appropriate common distribution assumptions). 

2.1. Random Time Change of Unit-Rate Poisson Processes 

We first represent arrivals and departures as random time changes of inde- 
pendent unit-rate Poisson processes. For that purpose, let A = {A(t) : t > 0} 
and S = {S(t) : t > 0} be two independent Poisson processes, each with rate 
(intensity) 1. We use the process A to generate arrivals and the process S to gen- 
erate service completions, and thus departures. Let the initial number of busy 
servers be Q(0). We assume that Q(0) is a proper random variable independent 
of the two Poisson processes A and S. 

The arrival process is simple. We obtain the originally-specified arrival process 
with rate A by simply scaling time in the rate-1 Poisson process A; i.e., we use 
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A\(t) = A(\t) for t > 0. It is elementary to see that the stochastic process A\ 
is a Poisson process with rate A. 

The treatment of service completions is more complicated. Let D(t) be the 
number of departures (service completions) in the interval [0,t]. We construct 
this process in terms of S by setting 

D(t) = S J q Q(s)ds^j , <>0, (11) 

but formula (fTTj) is more complicated than it looks. The complication is that Q(t) 
appearing as part of the argument inside S necessarily depends on the history 
{Q(s) : < s < £}, which in turn depends on the history of S, in particular, 
upon {S (/i f° Q{u) du) : < s < t}. Hence formula (IT]) is recursive; we must 
show that it is well defined. 

Of course, the idea is not so complicated: Formula (fTTj) is a consequence of 
the fact that the intensity of departures at time s is //Q(s), where the number 
Q(s) of busy servers at time s is multiplied by the individual service rate \i. The 
function /i J Q Q(s)ds appearing as an argument inside S serves as a random 
time change; see §11.6 of Bremaud [12], Chapter 6 of Ethier and Kurtz [19] and 
§7.4 of Daley and Vere- Jones [17]. 

By the simple conservation of flow, which expresses the content at time t as 
initial content plus flow in minus flow out, we have the basic expression 

Q{t) = Q(Q)+A{Xt)-D(t), t>0, 

= Q(0) + A(Xt)-s(fiJ^Q(s)dsj, t>0. (12) 

Lemma 2.1 (construction) The stochastic process {Q(t) : t > 0} is well defined 
as a random element of the function space D by formula (|12|) . Moreover, it is 
a birth- and- death stochastic process with constant birth rate = A and linear 
state- dependent death rate fj,k = k\x. 

Proof. To construct a bonafide random element of D, start by conditioning 
upon the random variable Q(0) and the two Poisson processes A and S. Then, 
with these sample paths specified, we recursively construct the sample path of 
the stochastic process Q = {Q(t) : t > 0}. By induction over the sucessive jump 
times of the process Q, we show that the sample paths of Q are right-continuous 
piecewise-constant real- valued functions of t. Since the Poisson processes A and 
S have only finitely many transitions in any finite interval w.p.l, the same is 
necessarily true of the constructed process Q. This sample-path construction 
follows Theorem 4.1 in Chapter 8 on p. 327 of Ethier and Kurtz [19]; the argu- 
ment also appears in Theorem 9.2 of Mandelbaum et al. [T5]. Finally, we can 
directly verify that the stochastic process Q satisfies the differential definition of 
a birth-and-death stochastic process. Let Tt represent the history of the system 
up to time t. That is the sigma field generated by {Q(s) : < s < t}. It is then 
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straightforward that the infinitesimal transition rates are as claimed: 



P(Q(t + h) - Q(t) = +l\Q(t) 
P(Q(t + h) - Q(t) = -l\Q(t) 
P(Q(t + h) - Q(t) = 0\Q(t) 



k,T t ) 



Xh + o(h) , 

kfih + o{h) , 

1 — Xh — kfj,h + o(h) , 



as h I for each k > 0, where o(h) denotes a function / such that f(h) jh — > as 
h | 0. Ethier and Kurtz [19] approach this last distributional step by verifying 
that the uniquely constructed process is the solution of the local-martingale 
problem for the generator of the Markov process, which in our case is the birth- 
and-death process. ■ 

Some further explanation is perhaps helpful. Our construction above is con- 
sistent with the construction of the queue-length process as a Markov process, 
specifically, a birth-and-death process. There are other possible constructions. 
A different construction would be the standard approach in discrete-event simu- 
lation, with event clocks. Upon the arrival of each customer, we might schedule 
the arrival time of the next customer, by generating an exponential interarrival 
time. We might also generate the required service time of the current arrival. 
With the simulation approach, we have information about the future state that 
we do not have with the Markov-process construction. The critical distinction 
between the different constructions involves the information available at each 
time. The information available is captured by the filtration, which we discuss 
with the martingale representations in the next section. 

The random-time-change approach we have used here is natural when apply- 
ing strong approximations, as was done by Mandelbaum, Massey and Reiman 
[15] and Mandelbaum and Pats [37] 05] ■ They applied the strong approximation 
for a Poisson process, as did Kurtz [39j . A different use of strong approxima- 
tions to establish heavy-traffic limits for non-Markovian infinite-server models 
is contained in Glynn and Whitt [22] , 

2.2. Random Thinning of Poisson Processes 

We now present an alternative construction, which follows §11.5 of Bremaud 
[T2] and Puhalskii and Reiman [53]. For this construction, let A\ and S^k for k > 
1 be independent Poisson processes with rate A and fi, respectively. As before, 
we assume that these Poisson processes arc independent of the initial number of 
busy servers, Q(0). This will not alter the overall system behavior, because the 
service-time distribution is exponential. By the lack-of-memory property, the 
remaining service times are distributed as i.i.d. exponential random variables, 
independent of the elapsed service times at time 0. 

We let all arrivals be generated from arrival process A\; we let service com- 
pletions from individual servers be generated from the processes S^k- To be 
specific, let the servers be numbered. With that numbering, at any time we use 
Sfi,k to generate service completions from the busy server with the fc th smallest 
index among all busy servers. (We do not fix attention on a particular server, 
because we want the initial indices to refer to the busy servers at all time.) 
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Instead of (fTT|) . we define the departure process by 



t 



D (t) = X] / l {Q( s -)> k } dS v,k( s ), t > , 



(13) 



where 1a is the indicator function of the event A; i.e., 1a (w) = 1 of S ^4 and 
1a(w) = otherwise. It is important that we use the left limit Q(s-) in the 
integrand of (|13|> , so that the intensity of any service completion does not depend 
on that service completion itself; i.e., the functions Q(s-) and l{Q( s _) = fc} are 
left-continuous in s for each sample point. Then, instead of (|12p . we have 



Q(t) = Q(0) + A(Xt)-J2 / l{Q(0>fe} <^, fc (s), *>0- ( 14 ) 



With this alternative construction, there is an analog of Lemma 12.11 proved 
in essentially the same way. In this setting, it is even more evident that we 
can construct a sample path of the stochastic process {Q{t) : t > 0} by first 
conditioning on a realization of Q(Q), {A\(t) : t > 0} and {S^k(t) : t > 0}, 



Even though this construction is different that the one in ^2.11 it too is 
consistent with the Markov-process view. Consistent with most applications, we 
know what has happened up to time t, but not future arrival times and service 
times. 

2.3. Construction from Arrival and Service Times 

Now, following Krichagina and Puhalskii [37] , we construct the sample paths 
of the process Q from the arrival and service times. This construction applies 
to the more general G/GI/oo system; we will show how the approach applies 
to the special case M/M/oo; see § §3.7l and l6.4l See [37] for references to related 
work. 

Let A(t) be the cumulative number of arrivals in the interval [0, t] and let Ti 
be the time of the i th arrival. Let all the service times be mutually independent 
random variables, independent of the arrival process and the number, Q(0), of 
customers in the system at time (before new arrivals). Let Q(0) be independent 
of the arrival process. Let the Q(0) initial customers have service times from 
{fji,i > 1} with cumulative distribution function (cdf) Fq. Let the new arrivals 
have service times from {r/i, i > 1} with cdf F. 

Then D{t), the number of customers that leave the system by time t, can be 
expressed as 




k> 1. 



Q(0) A(t) 



D(t) 



]T i(m <t) + J2 1 ( Ti + *h<t), t > o • 



(15) 



i=l i=l 



and 



Q(t) 



0(0) A(t) 

>t) + J2 ^ +»*>*)> 1 > ° • 



(16) 
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Since this construction does not require A to be Poisson (or even renewal) 
or the cdf's F and Fo to be exponential, this approach applies to the non- 
Markovian G/GI /oo model. However, the stochastic process Q itself is no longer 
Markov in the general setting. With Poisson arrivals, we can extend [37] to 
obtain a Markov process by considering the two-parameter process {Q(t 7 y) : 
< y < t,t > 0}, where Q(t,y) is the number of customers in the system at 
time t that have elapsed service times greater than or equal to y. (For simplicity 
in treating the initial conditions, let the initial customers have elapsed service 
times at time 0. Since P(A(0) = 0) = 1, Q(t, t) is (w.p.l) the number of initial 
customers still in the system at time t.) Then 

Q(0) A(t-y) 

Q(t,y) = ^l(77 4 >i)+ Hn + Th>t),0<y<t,t>0. (17) 

i=l i=l 

With renewal (GI) arrivals (and the extra assumption that P(A(0) = 0) = 1), 
we can obtain a Markov process by also appending the elapsed interarrival time. 
Of course, there is an alternative to (|17p if we add remaining times instead of 
elapsed times, but that information is less likely to be available as time evolves. 
Heavy-traffic limits for these two-parameter processes follow from the argument 
of [37], but we leave detailed discussion of these extensions to future work. For 
other recent constructions and limits in this spirit, see Kaspi and Ramanan [34] 
and references cited there. 

3. Martingale Representations 

For each of the sample-path constructions in the previous section, we have as- 
sociated martingale representations. At a high level, it is easy to summarize how 
to treat the first two sample-path constructions, drawing only on the first two 
chapters of Bremaud [12] . We represent the random time changes as stopping 
times with respect to appropriate nitrations and apply versions of the optional 
stopping theorem, which states that random time changes of martingales are 
again martingales, under appropriate regularity conditions; e.g., see Theorem 
T2 on p. 7 of [12] . The problems we consider tend to produce multiple random 
time changes, making it desirable to apply the optional stopping theorem for 
multiparameter random time changes, as in §§2.8 and 6.2 of Ethier and Kurtz 
[19] , but we postpone discussion of that more involved approach until Section 
O 

For the random thinning, we instead apply the integration theorem for mar- 
tingales associated with counting processes, as in Theorem T8 on p. 27 of 
Bremaud [12] , which concludes that integrals of predictable processes with re- 
spect to martingales associated with counting processes produce new martin- 
gales, under regularity conditions. 

We also present a third martingale representation, based on martingales asso- 
ciated with generators of Markov processes. That approach applies nicely here, 
because the queueing processes we consider are birth-and-death processes. Thus 
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we can also apply martingales associated with Markov chains, as on pp 5-6, 294 
of 12 . Finally, we present a fourth martingale representation associated with 



3.1. Martingale Basics 

In this subsection we present some preliminary material on continuous-time 
martingales. There is a large body of literature providing background, including 
the books by: Bremaud [12], Ethier and Kurtz [19], Liptser and Shiryayev [44], 
Jacod and Shiryayev [30] , Rogers and Williams [58] [59] , Karatzas and Shrcve 
[33] and Kallenberg [35]. The early book by Bremaud 12J remains especially 
useful because of its focus on stochastic point processes and queueing models. 
More recent lecture notes by Kurtz [UJ and van der Vaart [53] are very helpful 
as well. 

Stochastic integrals play a prominent role, but we only need relatively sim- 
ple cases. Since we will be considering martingales associated with counting- 
processes, we rely on the elementary theory for hnite-variation processes, as 
in §IV.3 of Rogers and Williams [58 . In that setting, the stochastic integrals 
reduce to ordinary Stieltjes integrals and we exploit integration by parts. 

On the other hand, as can be seen from Theorems 11.11 and 11.2] the limiting 
stochastic processes involve stochastic integrals with respect to Brownian mo- 
tion, which is substantially more complicated. However, that still leaves us well 
within the classical theory. We can apply the Ito calculus as in Chapter IV of 
[58] , without having to draw upon the advanced theory in Chapter VI. 

For the stochastic-process limits in the martingale setting, it is natural to turn 
to Ethier and Kurtz [19] . Liptser and Shiryayev [44] and Jacod and Shiryayev 
[30] . We primarily rely on Theorem 7.1 on p. 339 of Ethier and Kurtz [19] . 
The Shiryayev books are thorough; Liptser and Shiryayev [U] focuses on basic 
martingale theory, while Jacod and Shiryayev [30 focuses on stochastic-process 
limits. 

We will start by imposing regularity conditions. We assume that all stochastic 
processes X = {X(t) : t > 0} under consideration are measurable maps from an 
underlying probability space (fi, J 7 , P) to the function space D = D([0, oo),M) 
endowed with the standard Skorohod J\ topology and the associated Borel a- 
field (generated by the open subsets), which coincides with the usual <7-field 
generated by the coordinate projection maps; see §§3.3 and 11.5 of [69] . 

Since we will be working with martingales, a prominent role is played by 
the filtration (histories, family of a fields) F = {Tt ■ t > 0} defined on the 
underlying probability space (f2, !F, P). (We have the containments: JF fl C Tt 2 C 
T for all t\ < *2.) As is customary, see p. 1 of [44], we assume that all nitrations 
satisfy the usual conditions: 

(i) they are right-continuous: 




for all < t < oo , and 



u:u>t 
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(ii) complete (To, and thus Tt contains all P-null sets of T). 

We will assume that any stochastic process X under consideration is adapted 
to the filtration; i.e., X is F-adapted, which means that X(t) is T -measurable 
for each 4. These regularity conditions guarantee desirable measurability proper- 
ties, such as progressive measurability: The stochastic process X is progressively 
measurable with respect to the filtration F if, for each 4 > and Borel measur- 
able subset A of R, the set {(s,u>) : < s < t,w G ft, X(s,u) e A} belongs to 
the product a field #([0,4]) x Tt, where #([0,4]) is the usual Borel a field on 
the interval [0,4]. See p. 5 of [33] and Section 1.1 of [44]. In turn, progressive 
measurability implies measurability, regarding X(t) as a map from the product 
space ft x [0, oo) to R. 

A stochastic process M = {M(t) : 4 > 0} is a a martingale (submartingale) 
with respect to a filtration F = {Tt : 4 > 0} if M(4) is adapted to Tt and 
E[M(t)] < oo for each 4 > 0, and 

E[M{t + s)\T t ] = (>)M(4) 

with probability 1 (w.p.l) with respect to the underlying probability measure 
P for each 4 > and s > 0. 

It is often important to have a stronger property than the finite-moment 
condition E[M(t)] < oo. A stochastic process X = {X(t) : 4 > 0} is uniformly 
integrable (UI) if 

lim sup{£[|X(4)|l { | X ( t )|>n}]}=0; 

see p. 286 of [12] and p. 114 of [59]. We remark that UI implies, but is not 
implied by 

sup{E[\X(t)\]} < oo . (18) 
t>o 

A word of warning is appropriate, because the books are not consistent in their 
assumptions about integr ability. A stochastic process X = {X(t) : 4 > 0} may 
be called integrable if £'[|A(4)|] < oo for all 4 or if the stronger |[T8]) holds. 
This variation occurs with square-integrable, defined below. Similarly, the basic 
objects may be taken to be martingales, as defined as above, or might instead 
be UI martingales, as on p. 20 of [44] , 

The stronger UI property is used in preservation theorems - theorems im- 
plying that stopped martingales and stochastic integrals with respect to mar- 
tingales - remain martingales. In order to get this property when it is not at 
first present, the technique of localizing is applied. We localize by introducing 
associated stopped processes, where the stopping is done with stopping times. 
A nonnegative random variable r is an F-stopping time if stopping sometime 
before 4 depends only on the history up to time 4, i.e., if 

{t < 4} G T t for all 4 > . 

For any class C of stochastic processes, we define the associated local class 
Cioc as the class of stochastic processes {A (4) : 4 > 0} for which there exists a 
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sequence of stopping times {r„ : n > 1} such that r„ — > oo w.p.l asn-> oo and 
the associated stopped processes {X(r n At) : t > 0} belong to class C for each 
n, where a Ab = min{a, b}. We obtain the class of local martingales when 
C is the class of martingales. Localizing expands the scope, because if we start 
with martingales, then such stopped martingales remain martingales, so that all 
martingales are local martingales. Localizing is important, because the stopped 
processes not only are martingales but can be taken to be UI martingales; see 
p. 21 of [Hj and §IV.12 of [55]. The UI property is needed for the preservation 
theorems. 

As is customary, we will also be exploiting predictable stochastic processes, 
which we will take to mean having left-continuous sample paths. See p. 8 of [12] 
and §1.2 of [30] for the more general definition and additional discussion. The 
general idea is that a stochastic process X = {X(t) : t > 0} is predictable if 
its value at t is determined by values at times prior to t. But we can obtain 
simplification by working with stochastic processes with sample paths in D. In 
the setting of D, we can have a left-continuous process by either (i) considering 
the left-limit version {X(t— ) : t > 0} of a stochastic process X in D (with 
X(0— ) = X(0)) or (ii) considering a stochastic process in D with continuous 
sample paths. Once we restrict attention to stochastic processes with sample 
paths in D, we do not need the more general notion, because the left-continuous 
version is always well defined. If we allowed more general sample paths, that 
would not be the case. 

We will be interested in martingales associated with counting processes, 
adapted to the appropriate filtration. These counting processes will be nonneg- 
ative submartingale processes. Thus we will be applying the following special 
case of the Doob-Meyer decomposition. 

Theorem 3.1 (Doob-Meyer decomposition for nonnegative submartingales) If 
Y is a submartingale with nonnegative sample paths, E[Y(t)] < oo for each t, 
and Y is adapted to a filtration F = {Ft}, then there exists an F-predictable 
process A, called the compensator ofY or the dual-predictable projection, such 
that A has nonnegative nondecreasing sample paths, E[A(t)] < oo for each t, 
and M = Y — A is an F-martingale. The compensator is unique in the sense 
that the sample paths of any two versions must be equal w.p.l. 

Proof. See §1.4 of Karatzas and Shreve [33]. The DL condition in [33 is 
satisfied because of the assumed nonnegativity; see Definition 4.8 and Problem 
4.9 on p. 24. For a full account, see §VI.6 of [58]. ■ 

3.2. Quadratic Variation and Covariation Processes 

A central role in the martingale approach to stochastic-process limits is played 
by the quadratic-variation and quadratic-covariation processes, as can be seen 
from the martingale FCLT stated here in $8] That in turn depends on the notion 
of square- integrability. We say that a stochastic process X = {X(i) : t > 0} 
is square integrable if E[X(t) 2 ] < oo for each t > 0. We thus say that a 
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martingale M = {M(t) : t > 0} (with respect to some filtration) is square 
integrable if E[M (t) 2 ] < oo for each t > 0. Again, to expand the scope, we 
can localize, and focus on the class of locally square integrable martingales. 
Because we can localize to get the square-integrability, the condition is not very 
restrictive. 

If M is a square-integrable martingale, then M 2 = {M(t) 2 : t > 0} is nec- 
essarily a submartingale with nonnegative sample paths, and thus satisfies the 
conditions of Theorem 13.11 The predictable quadratic variation (PQV) of 
a square-integrable martingale M, denoted by (M) = {(M)(t) : t > 0} the 
(angle-bracket process), is the compensator of the submartingale M 2 ; i.e., 
the stochastic process (M) is the unique nondecreasing nonnegative predictable 
process such that E[(M)(t)] < oo for each t and M 2 — (M) is a martingale with 
respect to the reference filtration. (Again, uniqueness holds to the extent that 
any two versions have the same sample paths w.p.l.) Not only does square inte- 
grability extend by localizing, but Theorem [3J] has a local version; see p. 375 of 
[58] , As a consequence the PQV is well defined for any locally square-integrable 
martingale. 

Given two locally square-integrable martingales Mi and M 2 , the predictable 
quadratic covariation can be defined as 

(Mi, M 2 ) = J «Mx + M a ) - {M x - M 2 )) ; 

see p. 48 of [44]. It can be characterized as the unique (up to equality of 
sample paths w.p.l) nondecreasing nonnegative predictable process such that 
E[{M 1 ,M 2 )(t)\ < 00 for each t and M X M 2 - (Mi,M 2 ) is a martingale. 

We will also be interested in another quadratic variation of a square-integrable 
martingale M, the so-called optional quadratic variation [M] (OQV), the 
square-bracket process). The square-bracket process is actually more gen- 
eral than the angle-bracket process, because the square-bracket process is well 
defined for any local martingale, as opposed to only all locally square-integrable 
martingales; see §§IV. 18, 26 and VI. 36, 37 of [S5j. The following is Theorem 
37.8 on p. 389 of [S5j. For a stochastic process M, let AM(t) = M(t) - M(i-), 
the jump at t, for t > 0. 

Theorem 3.2 (optional quadratic covariation for local martingales) Let Mx 
and M 2 be local martingales with Mi(0) — A/ 2 (0) = 0. Then there exists a 
unique process, denoted by [M!,M 2 ], with sample paths of finite variation over 
bounded intervals and [Mi,M 2 ](0) = such that 

(i)MiM 2 — [Mi,M 2 ] is a local martingale 
(m)A[M 15 M 2 ] = (AMx)(AM 2 ) . 

For one local martingale M, the optional quadratic variation is then defined as 

[M] = [M, M]. 

Note that, for a locally square-integrable martingale M, both M 2 — (M) and 
M 2 — [M] are local martingales, but (M) is predictable while [M] is not. Indeed, 
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subtracting, we see that [M] — (M) is a local martingale, so that (M) is the 
compensator of both M 2 and [M] . 

There is also an alternative definition of the OQV; see Theorem 5.57 in §5.8 
of [64]. 

Theorem 3.3 (alternative definition) If M\ and M2 are local martingales with 
Mi(0) = M 2 (0) = 0, then 

OO 

[M 1 ,M 2 ](t)= lim V(Mi(t„, i )-M 1 (t„ ii _ 1 ))(M 2 (t n ,i)-M 2 (t„ ii _i)) , 

n — >oo * — » 

i=l 

where t n ^ = t A (i2~ n ) and the mode of convergence for the limit as n — > 00 
is understood to be in probability. The limit is independent of the way that the 
time points t n> i are selected within the interval [0,t], provided that t n ^ > t n ^\ 
and that the maximum difference t n ^ — for points inside the interval [0, t] 

goes to as n —>■ 00. 

Unfortunately, these two quadratic-variation processes (M) and [M] asso- 
ciated with a locally square-integrable martingale M, and the more general 
covariation processes (Mi,M2) and [M\,M2], are somewhat elusive, since the 
definitions are indirect; it remains to exhibit these processes. We will exploit 
our sample-path constructions in terms of Poisson processes above to identify 
appropriate quadratic variation and covariation processes in following subsec- 
tions. 

Fortunately, however, the story about structure is relatively simple in the 
two cases of interest to us: (i) when the martingale is a compensated counting 
process, and (ii) when the martingale has continuous sample paths. The story 
in the second case is easy to tell: When M is continuous, (M) = [M], and this 
(predictable and optional) quadratic variation process itself is continuous; see 
§VI.34 of |58J . This case applies to Brownian motion and our limit processes. 
For standard Brownian motion, (M)(t) = [M]{t) =t,t>0. 

3.3. Counting Processes 

The martingales we consider for our pre-limit processes will be compensated 
counting processes. By a counting process (or point process), we mean a stochas- 
tic process N = {N(t) : t > 0} with nondecreasing nonnegative-integer-valued 
sample paths in D and N(0) = 0. We say that N is a unit-jump counting process 
if all jumps are of size 1. We say that N is non-explosive if N(t) < 00 w.p.l for 
each t < 00. Equivalently, if {T„ : n > 1} is the associated sequence of points, 
where 

N(t) = max {n > : T n < t}, t>0, 

with To = 0, then N is a unit-jump counting process if T n+ \ > T n for all n > 0; 
while N is non-explosive if T n — > 00 w.p.l as n — > 00; see p. 18 of Bremaud 
(1981). 
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As discussed by Bremaud [12] . the compensator of a non-explosive unit-jump 
counting process is typically (under regularity conditions!) a stochastic process 
with sample paths that are absolutely continuous with respect to Lebesgue 
measure, so that the compensator A can represented as an integral 

A(t) = [ X(s)ds, t > , 
Jo 

where X = {X(t) : t > 0} is adapted to the filtration F. When the compensator 
has such an integral representation, the integrand X is called the stochastic 
intensity of the counting process N. 

We will apply the following extension of Theorem 13.11 

Lemma 3.1 (PQV for unit-jump counting processes) If N is a non-explosive 
unit-jump counting process adapted to F with E[N(t)\ < oo for all t, and if the 
compensator A of N provided by Theorem \3.l\ is continuous, then the martingale 
M = N — A is a square-integrable martingale with respect to F with quadratic 
variation processes: 

(M) = A and [M] = N . 

We first observe that the conditions of Lemma [3TT1 implv that N is a nonnegative 
F-submartingale, so that we can apply Theorem l3.ll We use the extra conditions 
to get more. We prove Lemma |3. II in the Appendix. 

3-4- First Martingale Representation 

We start with the first sample-path construction in £12.11 As a regularity 
condition, here we assume that E[Q(0)] < oo. We will show how to remove 
that condition later in §6.31 It could also be removed immediately if we chose 
to localize. 

Here is a quick summary of how martingales enter the picture: 

The Poisson processes A(t) and S(t) underlying the first representation of the 

queueing model in §2. II as well as the new processes A(Xt) and S (^fi J * Q(s) ds^j 
there have nondecreasing nonnegative sample paths. Consequently, they are 
submartingales with respect to appropriate nitrations (histories, families of a- 
fields). Thus, by subtracting the compensators, we obtain martingales. Then 
the martingale M so constructed turns out to be square integrable, admitting 
a martingale representation M 2 — (M) , where (M) is the predictable quadratic 
variation, which in our context will coincide with the compensator. 

In constructing this representation, we want to be careful about the filtration 
(histories, family of a fields). Here we will want to use the filtration F = \Tt ■ 
t > 0} defined by 

T t = o f<3(0), A(As), S (n Q{u) du^j :0<s<?j , t > , (19) 



augmented by including all null sets. 
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The following processes will be proved to be F-martingales: 
Afi(t) = A(Xt) - At, 

M 2 (t) = sUj^Q(s)d S S \- f i^Q(s)ds, t>0, (20) 

where here A refers to the arrival process. Hence, instead of ([12"]) , we have the 
alternate martingale representation 

Q(t)=Q(0) + Mi(t)- M 2 {t)+Xt- n f Q(s)ds , t>0. (21) 

Jo 

In applications of Theorem 13.11 and Lemma 13.11 it remains to show that the 
conditions are satisfied and to identify the compensator. The following lemma 
fills in that step for a random time change of a rate-1 Poisson process by ap- 
plying the optional stopping theorem. At this step, it is natural, as in §12 of 
Mandelbaum and Pats [48] . to apply the optional stopping theorem for mar- 
tingales indexed by directed sets (Theorem 2.8.7 on p. 87 of Ethier and Kurtz 
P35]) associated with multiparameter random time changes (§6.2 on p. 311 of 
|19j). but here we can use a more elementary approach. For supporting theory 
at this point, see Theorem 17.24 and Proposition 7.9 of Kallenberg [35] and §7.4 
of Daley and Vere- Jones [17]. We use the mutiparameter random time change 
in §0 

Let o be the composition map applied to functions, i.e., (x o y)(t) = x{y{t)). 

Lemma 3.2 (random time change of a rate-1 Poisson process) Suppose that S 
is a rate-1 Poisson process adapted to a filtration F = {J- t : t > 0} and I = 
{I(t) : t > 0} is a stochastic process with continuous nondecreasing nonnegative 
sample paths, where I(t) is an F-stopping time for each t > 0. In addition, 
suppose that the following moment conditions hold: 

E[I(t)} < oo and E[S(I(t))} < oo for all t > 0. (22) 

Then S o I = {S(I(t)) : t > 0} is a non-explosive unit-jump counting process 
such that M = Sol — I = {S(I(t)) — I{t) : t > 0} is a square-integrable 
martingale with respect to the filtration Fi = {J~i(t) ■ t > 0}, having quadratic 
variation processes 

(M)(t)=I(t) and [M]{t) = S(I(t)), t>0. (23) 

Proof. Since the sample paths of I are continuous, it is evident that So / is a 
unit-jump counting process. By condition (|22p . it is non-explosive. In order to 
apply the optional stopping theorem, we now localize by letting 

I m (t) = I(t) Am, t>0. 

Since I(t) is an F-stopping time, I m (t) is a bounded F-stopping time for each 
m > 1. The optional stopping theorem then implies that M m = S o I m — I rn = 
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{S(I m (t)) - I m {t) : t > 0} is an Fj-martingale; e.g., see p. 7 of pj or p. 
61 of [TH]. As a consequence I m is the compensator of S o I rn . Since we have 
the moment conditions in (|22|) , we can let to "f oo and apply the monotone 
convergence theorem with conditioning, as on p. 280 of Bremaud, to deduce 
that M = Sol — I = {S(I(t)) — I(t) : t > 0} is a martingale. Specifically, given 
that 

E[S(I m (t + a)) - I m (t + a)\F I(s) ] = S(I m (s)) - I m {s) w.p.l 
for all to, 

E[S{I m {t + s))\T I(s) ]^ E[S{I{t + s))\T I(s) ] w.p.l as to -> oo , 

^[/ m (i + s)|^> (s) ] -> £?[/(*+ s)|J> (s) ] w.p.l as m-^oo, 

S'(I Tn (s))) t ^(/(s)) and I m (s) | 7(s) as to -> oo, we have 

E[S(I(t + a)) - I(t + s)|J> w ] = S(J(s)) - 1(a) w.p.l 

Lemma [3TT] implies the square-integrability and identifies the quadratic variation 
processes. ■ 

To apply Lcmma l3.2l to our M/M /oo queueing problem, we need to verify the 
finite- moment conditions in (|22[) . For that purpose, we use a crude inequality: 

Lemma 3.3 (crude inequality) Given the representation (|12|) , 

Q(t) < Q(0) + A(Xt), t>0, (24) 

so that 

[ Q(s)ds<t(Q(0)+A(M)), t>0 . (25) 
Jo 

Now we want to show that our processes in (|20p actually are martingales with 
respect to the filtration in (|19p . To do so, we will apply Lemma \'S. 21 However, 
to apply Lemma 13.21 we will first alter the filtration. In order to focus on the 
service completions in the easiest way, we initially condition on the entire arrival 
process, and consider the filtration F 1 = {JFj 1 : t > 0} defined by 

T\ = a(Q(0),{A(u) : u > 0},S{s) : < s < t) , t >0, (26) 

augmented by including all null sets. Then, as in the statement of Lemma 13.21 
we consider the associated filtration F} = {^im '■ t > 0}. Finally, we are able 
to obtain the desired martingale result with respect to the desired filtration F 
in P]). 

Lemma 3.4 (verifying the conditions.) Suppose that E[Q(0)] < oo in the set- 
ting of 30 with {Q(t) : t > 0} defined in ifl^l. 

I(t)=fi f Q(s)ds, t>0, (27) 
Jo 

and the filtration being F 1 in (|26p . Then the conditions of Lemma YS .21 are satis- 
fied, so that So I — I is a sguare-integrable Fj-martingale with F] -compensator 
I in (|27p . As a consequence, S o I — I is also a square-integrable F -martingale 
with F-compensator I in (|2T|) for filtration F in (|19j) . 



G. Pang, R. Talreja and W. Whitt/ Martingale Proofs 



214 



Proof. First, we can apply the crude inequality in (|25| to establish the re- 
quired moment conditions: Since E[Q(0)] < oo, 



E 
and 



H I Q(s)ds 
o 



< nt{E[Q(0)} + E[A(Xt)]) = ntE[Q(0)} + fiXt 2 < oo, t > , 



S ( fi j Q{s) ds) <S (fit(Q(0) + A{Xt)) , t > 



so that 



E 



S[n I Q{s)ds 



< E[S(»t(Q(0) + A(Xt)))], 

= E[E[S(iJ,t(Q(p)+A(Xt)))\Q(0)+A(Xt)]}, t>0, 
= fit (E[Q(0)] + At) < oo, t >0. 



Then, by virtue of (fT2"| and the recursive construction in Lemma |2~T1 for each 
t > 0, I(t) in (|2T)l is a stopping time relative to T). for all x > 0, i.e., 

{/(*) < x} e T\ for all x > and i > . 

(This step is a bit tricky: To know I(t), we need to know Q(s),0 < s < t, 
but, by (2.2), that depends on I(s),Q < s < t. Hence, to know whether or not 
{I(t) < x} holds, it suffices to know S(u) : < u < x.) Since {S(t) — t : t > 
0} is a martingale with respect to F 1 and the moment conditions in ([22|) are 
satisfied, we can apply Lemma T3.2I to deduce that {S(I(t)) — I(t) : t > 0} is a 
square-integrable martingale with respect to the filtration F} = \JF}i t \ '■ t > 0} 
augmented by including all null sets and that / in (|27|) is both the compensator 
and the predictable quadratic variation. Finally, since the process representing 
the arrivals after time t, i.e., the stochastic process {A(t + s) — A(t) : s > 0}, 
is independent of Q(s), < s < t, by virtue of the recursive construction in 
Lemma 12.11 (and the assumption that A is a Poisson process), we can replace 
the filtration F} by the smaller filtration F in (fT9|) . That completes the proof. ■ 
We now introduce corresponding processes associated with the sequence of 
models indexed by n. We have 



,(*) = Qn(Q) + M* A (t) - M* >2 (t) + X n t - /i / Q n (s) ds , t > . (28) 



where 



M*M) = A{X n t)-X n t, 



) n (s) ds) — ix J Q n {s) ds, 



(29) 



The filtrations change with n in the obvious way 
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We now introduce the scaling, just as in ((3]). Let the scaled martingales be 

M* , (t) M*(t) 
M W|1 (t) = n ')2 and M W|2 (t) = n p , t>0. (30) 
V n V" 

Then, from l[2g |) -l|g0 jl . we get 

i(t) - n 



X n (t) = 



,(0)-n , KjW M l2(t) f f Qnjs) - n N t/ 



V n v n v n Jo V v n 

= X„(0) + M nA (t) - M n , 2 (t) - M / X„(s) ds , t > . (31) 

Jo 

Now we summarize this martingale representation for the scaled processes, 
depending upon the index n. Here is the implication of the analysis above: 

Theorem 3.4 (first martingale representation for the scaled processes) 
// E[Q n (0)] < oo for each n > 1, then the scaled processes X n in Q have 
the martingale representation 

X n (t) = X n (0) + M nil (t) - M n>2 (t) -pi [ X n (s)ds , t>0, (32) 



where M n ^ are given in (|29j) and ()30p . These processes M n ^ are square-integrable 
martingales with respect to the filtrations F n = {J- n .t ■ t > 0} defined by 



T n . t = a I Q n (0), j4(A n s), S \u J Q n (u) duj :0<s<tj , t > , 

augmented by including all null sets. Their associated predictable quadratic vari- 
ations are (M n ,i)(t) = X n t/n, t>0, and 



(M n>2 )(t) = £ / Q n (s)ds, t>0, (33) 
n Jq 

where E[(M n ^)(t)] < oo for all t > cmd n > 1. The associated optional 
quadratic variations are [M n ^i](t) — A(\ n t)/n, t > and 



S (m/ Q n (s) ds) 

[Afn, a ](*) = — ^ S t>0. 

n 

Note that X„ appears on both sides of the integral representation (|32|) . but 
X n (t) appears on the left, while X n {s) for < s < t appears on the right. In 
Sj4j] we show how to work with this integral representation. 
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3.5. Second Martingale Representation 

We can also start with the second sample-path construction and obtain an- 
other integral representation of the form (l32l) . 
Now we start with the martingales: 

M A {t) = A x (t)-Xt, 
Ms.Jt) = S^ k (t)-(it, 

M s (t) = J2 fhQis 

OO „f 
OO „ t 

= H / 1 {<3(« 

fc=l - 70 

so that, instead of (fT2|) . we have the alternate representation 

Q(*) - Q(0) + M A (t) - M s (t) + \t-n [ Q(s-) ds, , t > , (35) 

Jo 

where M4 and Ms are square-integrable martingales with respect to the filtra- 
tion F = {T t : t > 0} defined by 

T t = a{Q(0),A x (s),S^ k (s), k> 1:0 <s<t), t>0 , (36) 

again augmented by the null sets. Notice that this martingale representation is 
very similar to the martingale representation in (f!2Tj) . The martingales in (|2ip 
and ([35|) are different and the filtrations in (| 19|) and (|36|) are different, but 
the predictable quadratic variations are the same and the form of the integral 
representation is the same. Thus, there is an analog of Theorem 13.41 in this 
setting. 

We now provide theoretical support for the claims above. First, we put our- 
selves in the setting of Lemma 13.11 

Lemma 3.5 (a second integrable counting process with unit jumps) IfE[Q(Q)] < 
oo, then, in addition to being adapted to the filtration {Ft} in (|36|) . the stochastic 
process Y defined by 

oo „ t 

Y(t) = T / l {Q(s -)> fe } dS k (s), t > 0, (37) 
fe=i J ° 

is a unit-jump counting process such that E[Y(t)] < oo for all t > 0. 



oo „ t 

)>k) dS^kjs) — ^ / /il{Q(s-)>fc} ds, 
k=i Jo 

„t oo 

)>k} dS^k(s) - / Ml{Q(s-)>fc} ds, 
Jo ._ -, 



_)> fe } dS^k{s) - M / Q(s-)ds, 



k=l 
t 



(34) 
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Proof. It is immediate that Y is a counting process with unit jumps, but 
there is some question about integrability To establish integrability, we apply 
the crude inequality ([24]) to get 

oo „t oo 

y(t) <22 I l{Q(0)+A(At)>fc} dSk(s) < ^2 1 {Q(0)+A(\t)>k}Sk(t) , 

k=l 



^P(Q(0) + A(Ai) > k)E[S k (t)} 
fe=l 

oo 

^2 P(Q(0) + A(Xt) > k)fj,t 
LitE[Q{0) + A{Xt)} = nt(E[Q(0)] + Xt) < oo . ■ 

Given Lemmas 13.11 and 13.51 it only remains to identify the compensator of 
the counting process Y, which we call Y since A is used to refer to the arrival 
process. For that purpose, we can apply the integration theorem, as on p. 10 of 
Bremaud [12j . But our process Y in (|37[) is actually a sum involving infinitely 
many Poisson processes, so we need to be careful. 

Lemma 3.6 (identifying the compensator of Y) The compensator ofY in (I37| 
is given by 

y(i) = V/ l {Q{s -)> k} ^d S = ^ Q(s~)ds, t>0; (38) 
k=1 Ja Jo 

i.e., Y ~Y is an F -martingale for the filtration in (|36p . 

Proof. As indicated above, we can apply the integration theorem on p. 10 of 
Bremaud, but we have to be careful because Y involves infinitely many Pois- 
son processes. Hence we first consider the first n terms in the sum. With that 
restriction, since the integrand is an indicator function for each k, we consider 
the integral of a bounded predictable process with respect to the martingale 
{X}fe=i0^fc(^) — A 4 ^) : t ^ 0}; which is a martingale of "integrable bounded varia- 
tion," as required (and defined) by Bremaud. As a consequence, {Mg(t) : t > 0} 
is an F-martingale, where 

Ms(t) = J2 I l{Q(s-)>fc} dS Mjk (a) - V] / ^{Q(s~)>k} ds . (39) 
fe=i Jo fe=i Jo 

However, given that i£[Y(t)] < oo, we can apply the monotone convergence 
theorem to each of the two terms in (I39|) in order to take the limit as n — > oo to 
deduce that E[Y(t)] < oo and Ms itself, as defined in ([34]) . is an F-martingale, 
which implies that the compensator of Y in (|37f is indeed given by ([38]) . ■ 



so that 

E[Y(t)] < 

< 
< 
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By this route we obtain another integral representation for the scaled pro- 
cesses of exactly the same form as in Theorem 13.41 As before in (|2"8)) -(f3"l" j) . we 
introduce the sequence of models indexed by n. The martingales and nitrations 
are slightly different, but in the end the predictable quadratic variation processes 
are essentially the same. 

Theorem 3.5 (second martingale representation for the scaled processes) If 
E[Q n (0)] < oo for each n > 1, then the scaled processes X n in ^ have the 
martingale representation 

X n (t) = X n (0) + M nA (t) - M n . 2 (t) - fj. f X n (s)ds , t>0, (40) 

Jo 

where M n ^ are given in (|30p . but instead of (|29p . we have 
M* A (t) = A Xn (t)-X n t, 

°o pt pt 

MnA 1 ) = X] / 1 {Q n (s-)>k} dS vA S ) - AM Qn(s-)ds . 

k=1 Jo Jo 

These processes M n> i are square-integrable martingales with respect to the filtra- 
tions F n = {J- n .t '■ t > 0} defined by 

T n , t = o-{Q n (0),A Xn (s),S^ k (s)' k>l:0<s<t), t>0, 

augmented by including all null sets. Their associated predictable quadratic vari- 
ations are (M n ,i)(t) = X n t/n, t>0, and 

pt 

(M n , 2 }(t) = - / Q„(s)ds, t >0 , (41) 
n Jo 

where E[(M n ^)(t)] < oo for all t > and n > 1 and (M Hj i)(() = (\ n t/n) = fit. 
The associated optional quadratic variations are [M nt i](t) = A\ n (t)/n, t > 0, 
and 

[M n , 2 ](t) = ^^W-H^^W , t >0. (42) 

3.6. Third Martingale Representation 

We can also obtain a martingale representation for the stochastic process Q 
by exploiting the fact that the stochastic process Q is a birth-and-death process. 
We have the basic representation 

Q(t) = Q(0) + A{t) - D(t), t>0, 

where A is the arrival process and D is the departure process, as in (fT2"]) . Since 
Q is a birth-and-death process, we can apply the Levy and Dynkin formulas, 
as on p. 294 of Bremaud [12] to obtain martingales associated with various 
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counting processes associated with Q, including the counting processes A and 
D. Of course, A is easy, but the Dynkin formula immediately yields the desired 
martingale for D: 

M D (t) = D(t) - a f Q(s) ds, t > , 
Jo 

where the compensator of Mr> is just as in the first two martingale representa- 
tion, i.e., as in (J20J) , (27]). (34]) and (38]); see pp 6 and 294 of [12]. We thus again 
obtain the martingale representation of the form ([2T]) and ([35]) . Here, however, 
the filtration can be taken to be 

T t = a (Q(s) : < s < t) , t > . 

The proof of Theorem 11.11 is then the same as for the second representation, 
which will be by an application of the martingale FCLT in fJ5] 

3.7. Fourth Martingale Representation 

In this section, we present the martingale representation for the construction 
in terms of arrival and service times in ^2.31 but without any proofs. Consider 
a sequence of G/GI /oo queues indexed by n and let Q n (0), Q n , A n , and D n be 
the corresponding quantities in the n th queueing system, just as defined in £12.31 
For any cdf F, let the associated complementary cdf (cedf) be F c = 1 — F. 

Given representation (117|) . the Krichagina and Puhalskii [37] insight is to 
write the process Q n as 

Q»(o) 

Q n (t) = > *) " F o c W) + Qn(0)F^(t) 

i=i 

l( S |x>i)ffi„|^,i), (43) 
where 

L»*J 

K n (t,x) = -y^l(r)i<x), t>0, x>0 : (44) 
»=l 

is a sequential empirical process (a random field, having two parameters), so 
that 

(A (t) \ 1 

K n \^±±,x = - y 1(th <x), t>0, x>0. (45) 

v 7 i—l 

The division by n in pi]) provides a law-of-large-numbers (LLN) or fluid- 
limit scaling. To proceed, we define associated queueing processes with LLN 
scaling. In particular, define the normalized processes Q n = {Q n (t),t > 0}, 
A^ = {A n (t), t > 0} and D n = {D n (t), t > 0} as 
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Q n (t) = -Q„(t), A n {t) = -A n (t), D„{t) = -D n (t), t>0. (46) 

For our general arrival process, we assume that A n {t) — > a(t) = [it w.p.l as 
n — > co. For the M/M/oo special case, that follows from (|T|). 
Next write equation (14"5)) as 



- , x 



A„(t) 



<x)-F(x)) +-A n (t)F{x) 



+ -= [V»(An(t) - o(t))l F(s) + a(t)F(x), 
\ n L J 



A„(t) 



Now we introduce stochastic processes with central-limit-theorem (CLT) scal- 
ing. In particular, let 



A n (t) 



(=1 



and 
Then 



A n (t) = VE(A n (t) - a(t)). 



(47) 



tf n (A n (t),a:) = -Lv n (t,x) + -^=A n (t)F{x) + a(t)F{x), 
\/n Jn 



so that the process Q n in can be written as 



,(0) 



;.=i 



o jo 

t poo 



Q n (t) = E W** > *) " F o(*)) + Qn(0)F o c (t) 

l(s + x > t)dl/„(s, x) 
l(s + x> t)dA n (s)dF{x) 
l(s + x > t)da{s)dF{x) 

JO 

Q„(0) 

E (l(^>t)-F o c (t)) + Q„(0)F o c (t) 



-70 



(48) 



i=l 



l(s + x > t)dV n (s, x) 



Jo 

t 



+ y/n F c (t- s)dA n {s)+n F c {t-s)da{s) 



where 



and 
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Qn(0) .t 

V (1(7?, > *) - Fo(t)) + Qn(0)F^(t) + n F c (t- s)da(s) 
i=i 70 
+Vn(M n| i(i)-Af n , a (t)) , 



M n ,i(i) = / F c (t-s)dA n (s) (49) 
Jo 



M n , 2 (i) = l(s + x>t)dV n (s,x) 

Jo Jo 

f-t />oo 

= / / l(s + a;<t)dy n (s,x). (50) 
Jo Jo 

In contrast to previous representations, note that, except for Q ra (0) which 
can be regarded as known, Q n {s) for s < t does not appear on the righthand 
side of representation (|48p . Instead of the integral representations in Theorems 
3.41 and 13.51 here we have a direct expression of Q n {t) in terms of other model 
elements, but we will see that some of these model elements in turn do have 
integral representations. 

By equations (jl6) and 1(15]). we have 

Qn(t) = - V > t) - F c (i)) + Q n (0)F<(t) + / F c (t - s)da(s) 

n ti Jo 

+ -L(M n , 1 (t)-M„, 2 (i)), <>0. (51) 



From equation ([51]) . we can prove the following FWLLN. We remark that we 
could allow more general limit functions a for the LLN-scaled arrival process. 

Theorem 3.6 (FWLLN for the fourth martingale representation) If there is 
convergence (Q n (0), A n ) =>- (q(0),a) in R x D as n — > oo, where a(t) = jit. 
t > 0, then Q n q, where 

q(t) = q{0)F{j(t) + f F c (t - s)da(s), t > . (52) 
Jo 

For the M/M /oo special case, 

q(t) = q(0)e-^ + /j f e^^ds = 1 - (1 - g(0))e -/rf , t > 0. 
Jo 

If, in addition, q(0) — 1, then q(t) = 1 for t > 0. 
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Let the scaled process X n be defined by 

X n {t) = V^iQnit) - «(<)), t >0. (53) 

If q(t) = 1 for t > 0, then ([53")) coincides with (J3J). By equations (|55|) . (fSTj) and 
(|52|) , we obtain the following theorem for the scaled processes. 



Theorem 3.7 (fourth martingale representation for the scaled processes) The 
scaled process X n in (|53[) has the representation 

, Qn(0) 

X »W = ~m E > f ) - F o(*)) + V^(Qn(0) - g(0))F o c (i) 

V I— 1 

+M n ,i(t)-M n>2 (t), t>0, (54) 

where M n> i and M Ul 2 are defined as in (|49p and (|50|) . respectively. 

The situation is more complicated here, because the processes M^i and M nj 2 
in (|4T)1) and ([5H| are not naturally martingales for the G/GI/oo model or even 
the M/M/oo special case, with respect to the obvious filtration, but they can 
be analyzed by martingale methods. In particular, associated martingales can 
be exploited to establish stochastic-process limits. In particular, the proof of the 
FCLT for the processes X n in (|53|) - see £16.41 - exploits semimartingale decompo- 
sitions of the following two-parameter process U n = {U n (t, x), t > 0, < x < 1} 
(and related martingale properties): 

\nt\ 

U n (t,x)= V(l(Ci , (55) 

where the ("i are independent and uniformly distributed on [0, 1]. 

Extending Bickel and Wichura |7J , Krichagina and Puhalskii [37] proved that 
the sequence of processes {Un,n > 1} converges in distribution to the Kiefer 
process U in D([0, oo), -D([0, 1])). For properties of Kiefer processes, we refer 
to Csorgo M. and P. Revez [13] and Khoshnevisan [35]. The importance of the 
Kiefer process for infinite-server queues was evidently first observed by Louchard 

The process U n has the following semimartingale decomposition (See Chapter 
IX of Jacod and Shiryaev [3U]): 

U n (t,x)=- [ v) dy + M n . (t,x), 

1-y 



o 



where 



1 ^ / r ACl 1 
M n , (*,s) = -j= V (l(C< <x)- / , dy 



is a square- integrable martingale relative to the filtration F n = Vi<|nt| J~" z (x) 
and .T^fx) = a(l(Ci < y),0 < y < x) V AT for all x € [0, 1]. 
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Hence V n (t, x) = U n (a n (t), F(x)) can be written as 

V n {t, x) = - jT l n Sp^\ dF{y) + L n (t, x) , 



where 

M*, *) - ^ g (ifo < *) - X ir^^). 

F-{y) is the left-continuous version of F, F_(0) = and V n ,- is the left- 
continuous version of V n in the second argument. Therefore, M n ,2 can be written 
as 

M n , 2 (t) = G n (t) + H n (t) , 

where 

rt V n -(t - x,x) 



1 - F-{x) 



-dF{x) 



and 

ft poo 



H n (t) = / / l(s + x<t)dL n (s,x). 
Jo Jo 

In closing this subsection, we remark that an associated representation holds 
for the two-parameter process Q(t,y) in (fT7|) . Let the associated scaled two- 
parameter process be defined by 

X n (t,y) = y/K(Q n (t,y)-q(t,y)), t > 0, (56) 

where Q n (t, y) = Q n {t, y)/n, Q n =>■ 9 as n — > oo and 

?(t, y) = g(0)F o c (i) + / " F c (i - s) da(«) . (57) 
Jo 

Corollary 3.1 (associated representation for the scaled two-parameter processes) 
Paralleling (|54|) . i/ie scaled process in (I56[) /ias i/ie representation 

1 Qn(0) 

x n (t, y) = ^ E > *) - F o(*)) + V^(Q«(0) - g(0))F o c (t) 

+M n ,i(t,y)-M n , 2 (t,y), t > 0, (58) 
where, paralleling (|49|) and (|50[) . 



and 



M„,i(t,y) = [ V F c (t-s)dA n (s) 



pt—y poo 

M n , 2 (t,y) = / / l(s + x<t)dV n (s,x). 
Jo Jo 
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4. Main Steps in the Proof of Theorem 11.11 

In this section we indicate the main steps in the proof of Theorem 11.11 starting 
from one of the first three martingale representations in the previous section. 
First, in i j4.ll we show that the integral representation appearing in both The- 
orems 13.41 and 13.51 has a unique solution, so that it constitutes a continuous 
function from D to D. Next, in ^4.21 we show how the limit can be obtained 
from the functional central limit theorem for the Poisson process and the con- 
tinuous mapping theorem, but a fluid limit (Lemma l4.2l or Lemma |4.3|) remains 
to be verified. In £14.31 we show how the proof can be completed without martin- 
gales by directly establishing that associated fluid limit. In §fj5][6]we show how 
martingales can achieve the same result. In §6.41 we indicate how to complete 
the proof with the fourth martingale representation. 

4- 1. Continuity of the Integral Representation 

We apply the continuous-mapping theorem (CMT) with the integral 
representation in (f3"2| and ((4TJ|) in order to establish the desired convergence; for 
background on the CMT, see §3.4 of 69J. In subsequent sections we will show 
that the scaled martingales converge weakly to independent Brownian motions, 
i.e., 

(M„,i, M„ >2 ) =*> (V/LBi, V/LB 2 ) in D 2 = D x D as n -> oo , (59) 

where B\ and B 2 are two independent standard Brownian motions, from which 
an application of the CMT with subtraction yields 

M„,i - M n .2 => ■sfiLBx - ^B 2 = y/2^B in D as n -> 00 , (60) 

where B is a single standard Brownian motion. 

We then apply the CMT with the function / : D xl-> D taking (y, b) into 
x determined by the integral representation 

x(t) =b + y(t) - a f x(s)ds, t>0. (61) 
Jo 

In the pre-limit, the function y in (f6"Tj) is played by M n< % — M n> 2 = {M n< i(t) — 
M n< 2(t) : t > 0} in (|60|). while b is played by X n (0). In the limit, the function 
y in (fBTI) is played by the limit y/2jlB in ([60]) . while b is played by X(0). (The 
constant b does not play an essential role in (|6 1 [) : it is sometimes convenient 
when we want to focus on the solution 1 as a function of the initial conditions.) 

For our application, the limiting stochastic process in (|60l) has continuous 
sample paths. Moreover, the function / in ([61]) maps continuous functions into 
continuous functions, as we show below. Hence, it suffices to show that the 
map / : D x M — > D is measurable and continuous at continuous limits. Since 
the limit is necessarily continuous as well, the required continuity follows from 
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continuity when the function space D appearing in both the domain and the 
range is endowed with the topology of uniform convergence on bounded inter- 
vals. However, if we only establish such continuity, then that leaves open the 
issue of measurability. It is significant that the a field on D generated by the 
topology of uniform convergence on bounded intervals is not the desired cus- 
tomary a field on D, which is generated by the coordinate projections or by 
any of the Skorohod topologies; see §11.5 of [55] and §18 of Billingsley [Sj. We 
prove measurability with respect to the appropriate a field on D (generated by 
the Ji topology) by proving continuity when the function space D appearing 
in both the domain and the range is endowed with the Skorohod J± topology. 
That implies the required measurability. At the same time, of course, it provides 
continuity in that setting. 

We now establish the basic continuity result. We establish a slightly more 
general form than needed here in order to be able to treat other cases. In par- 
ticular, we introduce a Lipschitz function h : R — > R; i.e., we assume that there 
exists a constant c > such that 



We apply the more general form to treat the Erlang A model in £17.11 Theorem 
17.31 in i)7. 21 involves an even more general version in which h : D — ► D. 

Theorem 4.1 (continuity of the integral representation) Consider the integral 
representation 



where h : R — > R satisfies h(0) = and is a Lipschitz function as defined in 
(|62[) . The integral representation in (|63p has a unique solution x, so that the 
integral representation constitutes a function f : D x R — > D mapping (y, b) into 
x = f(y, b). In addition, the function f is continuous provided that the function 
space D (in both the domain and range) is endowed with either: (i) the topology 
of uniform convergence over bounded intervals or (ii) the Skorohod J\ topology. 
Moreover, if y is continuous, then so is x. 

Proof. If y is a piecewise-constant function, then we can directly construct 
the solution x of the integral representation by doing an inductive construction, 
just as in Lemma |2. II Since any element y of D can be represented as the limit 
of piecewise-constant functions, where the convergence is uniform over bounded 
intervals, using endpoints that are continuity points of y, we can then extend the 
function / to arbitrary elements of D, exploiting continuity in the topology of 
uniform convergence over bounded intervals, shown below. Uniqueness follows 
from the fact that the only function x in D satisfying the inequality 



\h(si) — h(s2)\ < c\si — s 2 \ for all si, s 2 € R . 



(62) 




(63) 
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is the zero function, which is a consequence of Gronwall's inequality, which we 
re-state in Lemma T4. II below in the form needed here. 

For the remainder of the proof, we apply Gronwall's inequality again. We 
introduce the norm 



Ft 



sup \x{t)\ 

0<t<T 



First consider the case of the topology of uniform convergence over bounded 
intervals. We need to show that, for any e > 0, there exists a 6 > such that 
\\xi — x 2 \\t < e when \bi — b 2 \ + \ \yi — y 2 \\r < S, where (y t ,x t ) are two pairs of 
functions satisfying the relation (|6"3")h From we have 



\xi(t) - x 2 (t)\ < \h - h\ + \yi(t) - y 2 {t)\ + \h{x 1 (s))-h{x 2 (s))\ds , 



< |&i-&2| + |yi(t)-Ifc(t)|+c / \xi{s)-x 2 (s)\ds . (64) 

Jo 

Suppose that \b\ — b 2 \ + \ \yi — ij 2 \\t < S. By Gronwall's inequality, 

\xi{t) - x 2 (t)\ < Se ct and | \ Xl - x 2 \ \ T < Se cT . 

Hence it suffices to let 5 = ee~ cT . 

We now turn to the Skorohod J\ topology; see §§3.3 and 11.5 and Chapter 12 
of [69] for background. To treat this non-uniform topology, we will use the fact 
that the function x is necessarily bounded. That is proved later in Lemma 15.51 
We want to show that x n — > x in -D([0, oo), R, J\) when b n —> b in R and y n — > y 
in D([0, oo), R, J\). For y given, let the interval right endpoint T be a continuity 
point of y. Then there exist increasing homcomorphisms A n of the interval [0, T] 
such that \\y n — y \i\\T — * and |jA„ — e||x — > as n — > oo. Moreover, it suffices 
to consider homcomorphisms A„ that arc absolutely continuous with respect to 
Lebesgue measure on [0,T] having derivatives A„ satisfying ||A„ — 1||t — * as 
n — > oo. The fact that the topology is actually unchanged is a consequence 
of Billingsley's equivalent complete metric do on pp 112-114 of Billingsley 8 . 
Hence, for y given, let M = sup 0<t<T {|x(t)|}. Since h in ([62f is Lipschitz, we 
have 

sup {\h(x(t))\} < h(0) + sup {\h(x(t)) - h(0)\} < h(0) + cM = cM . 

0<t<T 0<t<T 

Thus we have 



\x n (t) - x(X n (t))\ < \b n - b\ + \\y n - y o \ n \\ T 

rt /-A„(t) 

h{x n {u))du— / h(x(u))du 



< \b n - b\ + \\y n - yo \ n \\ T 

h(x n {u))du— I h(x(X n (u)))X n (u) du 
io Jo 
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< \b n - b\ + \\y n - y o A„||t + ||A„ - 1||t / \h(x(u))\ du 

Jo 

+ \h(x n (u)) - h(x(\ n (u)))\du 
Jo 

< K ~ b\ + \\y n - V o A„|| T + || A„ - 1\\t{cMT) 

+c / |a; n (ti) - x(A„(it))| du . 
Jo 

Choose n such that ||A n -l||r < S/(2cMT) and \b n -b\ + \\y n ~yo\ n \\ T < 6/2. 
Then Gronwall's inequality implies that 

\x n (t) - x(X n (t))\ < 6e ct for all t, < t < T , 

so that 

|| x„ -xo \ n \\ T < Se cT . 

Hence, for e > given, choose 8 < ee~ cT to have ||a;„ — xo \ n \\ T < e for n > n . 
If necessary, choose n larger to make ||A„ — e\\r < e and ||A„ — 1||t < e as well. 
Finally, for the inheritance of continuity, note that 

l-t+S 

x(t + s) - x(t) = y(t + s)- y(t) + J h(x(u)) du , 

so that 

l-t+S 

\x(t + s) - x(t)\ < \y(t + s) - y(t)\ + J \h(x(u))\ du . 

Since x is bounded over [0, T], x is continuous if y is continuous. ■ 

In our case we can simply let h(s) = /is, but we will need the more compli- 
cated function h in ([63|) and (|62|) in $7.1{ To be self-contained, we now state 
a version of Gronwall's inequality; see p. 498 of [TI5]. See §11 of [35] for other 
versions of Gronwall's inequality. 

Lemma 4.1 (version of Gronwall's inequality) Suppose that g : [0, oo) — > [0, oo) 
is a Borel-measurable function such that 

0<g(t)<e + M [ g(s)ds, < i < T , 
Jo 

for some positive finite e and M . Then 

g(t) < ee Mt , < t < T . 

It thus remains to establish the limit in ((59|) . Our proof based on the first 
martingale representation in Theorem 13 . 41 relies on a FCLT for the Poisson pro- 
cess and the CMT with the composition map. The application of the CMT with 
the composition map requires a fluid limit, which requires further argument. 
That is contained in subsequent sections. 
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4.2. Poisson FCLT Plus the CMT 

As a consequence of the last section, it suffices to show that the scaled mar- 
tingales converge, as in (|59|) . From the martingale perspective, it is natural to 
achieve that goal by directly applying the martingale FCLT, as in §7.1 of Ethier 
and Kurtz [19] . and as reviewed here in <j8j and that works. In particular, the 
desired limit (|59| follows from Theorems 13.41 and 18.11 (ii) (or Theorems 13.51 and 
18.11 (ii)) plus Lemma [4.21 below. Lemma [4.21 shows that the scaled predictable 
quadratic variation processes in (|33|) and (|41| converge, as required in condition 
(fT33l) of Theorem O here; see W2\ 

However, starting with the first martingale representation in Theorem l3.41 we 
do not need to apply the martingale FCLT. Instead, we can justify the martin- 
gale limit in (|59|) by yet another application of the CMT, using the composition 
map associated with the random time changes, in addition to a functional central 
limit theorem (FCLT) for scaled Poisson processes. Our approach also requires 
establishing a limit for the sequence of scaled predictable quadratic variations 
associated with the martingales, so the main steps of the argument become the 
same as when applying the martingale FCLT. 

The FCLT for Poisson processes is a classical result. It is a special case of 
the FCLT for a renewal process, appearing as Theorem 17.3 in Billingsley 8 . 
It and generalizations are also discussed extensively in [59] : see §§6.3, 7.3, 7.4, 
13.7 and 13.8. The FCLT for a Poisson process can also be obtained via a strong 
approximation, as was done by Kurtz [39] , Mandelbaum and Pats [47] [48] and 
Mandelbaum, Massey and Reiman [46]. Finally, the FCLT for a Poisson process 
itself can be obtained as an easy application of the martingale FCLT, as we 
show in $8] 

We start with the scaled Poisson processes 

, . A(nt) — nt , , , . . S(nt) — nt „ . . 

M A>n {t) = -±-L= and M s ,„(t) = -±-L= , t > . (65) 

v n V n 

We employ the following basic FCLT: Since A and S are independent rate-1 
Poisson processes, we have 

Theorem 4.2 (FCLT for independent Poisson processes) If A and S are inde- 
pendent rate-1 Poisson processes, then 

(M Atn ,M s , n ) => {B!,B 2 ) in D 2 = D x D as n ->■ 00 , (66) 

where M A , n and Ms, n ore the scaled processes in (|65p , while B\ and B2 are 
independent standard Brownian motions. 

We can prove the desired limit in ([59"]) for both martingale representations, 
but we will only give the details for the first martingale representation in The- 
orem 13.41 In order to get the desired limit in (|59p , we introduce a deterministic 
and a random time change. For that purpose, let e : [0,oo) — > [0, 00) be the 
identity function in D, defined by e(t) = t for t > 0. Then let 

#A,n(*) = — = ut = (fie)(t), 
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$ Sjn (t) = !=■ / Q n (s)ds, t>0. (67) 

We will establish the following fluid limit, which can be regarded as a func- 
tional weak law of large numbers (FWLLN). Here below, and frequently later, 
we have convergence in distribution to a deterministic limit; that is equivalent 
to convergence in probability; see p. 27 of [5]. 

Lemma 4.2 (desired fluid limit) Under the conditions of Theorem II. 11 

&S,n A*e in D as n — > oo , (68) 

where $s jn is defined in (|67[) . 

For that purpose, it suffices to establish another more basic fluid limit. Con- 
sider the stochastic process 

9 s , n (t) = ^-, t>0. (69) 

Let uj be the function that is identically 1 for all t. 

Lemma 4.3 (basic fluid limit) Under the conditions of Th eo rem 11.11 

^s.n =>■ w -D as n — ► oo , (70) 

where ^ s.n * s defined in (I69[) cmd = 1, t > 0. 

Proof of Lemma 14.21 The desired fluid limit in Lemma 14.21 follows from 
the basic fluid limit in Lemma 14.31 by applying the CMT with the function 
h : D — > D defined by 

h(x)(t) = n[ x(s)ds, t>0. ■ (71) 
Jo 

We thus have the following result 
Lemma 4.4 (all but the fluid limit) If the limit in (|70| holds, then 

(M nA ,M n , 2 )^(y/JlB 1 ,^JlB 2 ) in D 2 (72) 

as required to complete the proof of Theorem II. U 

Proof. From the limit in ([55]) . the desired fluid limit in ([58]) and Theorem 
11.4.5 of [55], it follows that 

(M Atn ,iie,M s , n ,$s,n) (B 1 , f ie,B 2 ,fie) in D 4 = D x • ■ • x D (73) 

as n — > oo. From the CMT with the composition map, as in §§3.4 and 13.2 of 
[55] - in particular, with Theorem 13.2.1 - we obtain the desired limit in ([55]) : 

(Af n ,i, M n , 2 ) = (M Atn o fie, M s , n o $s,n) => (#i o /xe, B 2 o /xe) in L> 2 (74) 
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asn->oo. By basic properties of Brownian motion, 

(Bi o fj,e, B 2 o fj,e) = {*/jiBi,*/jiB2) in D 2 . ■ 

It thus remains to establish the key fluid limit in Lemma 14.31 In the next 
section we show how to do that directly, without martingales, by applying the 
continuous mapping provided by Theorem l4.1l in the fluid scale or, equivalently, 
by applying Gronwall's inequality again. We would stop there if we only wanted 
to analyze the M/M/oo model, but in order to illustrate other methods used 
in Krichagina and Puhalskii 57] and Puhalskii and Reiman [S3] , we also apply 
martingale methods. Thus, in the subsequent four sections we show how to es- 
tablish that fluid limit using martingales. Here is an outline of the remaining 
martingale argument: 

(1) To prove the needed Lemma T4.31 it suffices to demonstrate that {X n } is 
stochastically bounded in D. (Combine Lemma T5 .91 and §6.11 ) 

(2) However, {X n } is stochastically bounded in D if the sequences of martin- 
gales {M nj i} and {M n ^2] are stochastically bounded in D. (Combine Theorem 
13.41 and Lemma WM ) 

(3) But then the sequences of martingales {M n l \ and {M„ 2 } are stochasti- 
cally bounded in D if the associated sequences of predictable quadratic varia- 
tions {(M n i)(i)} and {(M n 2 )(t)} are stochastically bounded in M for each t > 
(Apply Lemma \5. 8 1 One of these is trivial because it is deterministic.) 

(4) Finally, we establish stochastic boundedness of {(M n ^){t)} (the one non- 
trivial case) through a crude bound in §6.21 

This alternate route to the fluid limit is much longer, but all the steps might 
be considered well known. We remark that the fluid limit seems to be required 
by any of the proofs, including by the direct application of the martingale FCLT. 

4-3. Fluid Limit Without Martingales 

In this section we prove Lemma 14.31 without using martingales. We do so 
by establishing a stochastic-process limit in the fluid scale which is similar to 
the corresponding stochastic-process limit with the more refined scaling. This 
is a standard line of reasoning for heavy-traffic stochastic-process limits; e.g., 
see the proofs of Theorems 9.3.4, 10.2.3 and 14.7.4 of Whitt [69]. The specific 
argument here follows §6 of Mandelbaum and Pats [47] . With this approach, even 
though we exploit the martingale representations, we do not need to mention 
martingales at all. We are only applying the continuous mapping theorem. 

By essentially the same reasoning as in £13.41 we obtain a fluid-scale analog 
of (33} and (32]): 
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Q n (p)-n , M* tl (t) M* 2 (t) \ n t-nnt 



n n 



= X n (Q) + M nil (t) - M«, a (t) - /i / *»(«) da , * > , (75) 

Jo 

where 

M*,(t) _ M*M) 

M nA (t) = "' 1W and M n , 2 (t)= "' 2W , i>0, (76) 
n n 

with M* A (t) defined in 1(25)1 . 

Notice that the limit 

X n => r\ in D k as n — > oo , (77) 

where 

T]{t) = 0, t > , (78) 



is equivalent to the desired conclusion of Lemma 14.31 Hence we will prove the 
fluid limit in (fTF]) . 

The assumed limit in ([JJ implies that X n (Q) in R as n — > oo. We can 
apply Theorem 14.11 or directly GronwalPs inequality in Lemma 14.11 to deduce 
the desired limit (|77|) if we can establish the following lemma. 



Lemma 4.5 [fluid limit for the martingales) Under the conditions of Theorem 

El 

Mn.i V in D w.p.l as n — * oo , (79) 
for i = 1,2, where M n ^ is defined in (|76|) and rj is defined in (|78|). 



Proof of Lemma l4.5l We can apply the SLLN for the Poisson process, which 
is equivalent to the more general functional strong law of large numbers (FS- 
LLN); see §3.2 of [70]. (Alternatively, we could apply the FWLLN, which is a 
corollary to the FCLT.) First, the SLLN for the Poisson process states that 

Mt) , , S(t) 

> 1 and > 1 w.p.l as t — > oo , 

* t 

which implies the corresponding FSLLN's 

r A(nt) , r-Sfni) , , , 

sup {— K —-t}^0 and sup {^-^-t}^0 w.p.l (80) 
o<t<r n o<t<r n 

as n — > oo for each T with < T < oo. We thus can treat M nj i directly. To treat 
M n> 2, we combine (|80|) with the crude inequality in (|25l) and the representation 
in (|75|) - ((76l) in order to obtain the desired limit (|79|) . To elaborate, the crude 
inequality in ((25|) implies that, for any Ti > 0, there exists T 2 such that 

P ( - J Qn(s) ds > T 2 ^ -> as n ->• oo . 



G. Pang, R. Talreja and W. Whitt/ Martingale Proofs 



232 



That provides the key, because 



P(||M„, 2 || Tl > e) <P 




i \ 

Q n (s)ds > T 2 +P(\\S n \\ T2 > e/2) 



where 



S n (t) 



S(nt) — nt 



t > . ■ 



n 



5. Tightness and Stochastic Boundedness 
5. 1 . Tightness 

As indicated at the end of i)4.21 we can also use a stochastic-boundedness 
argument in order to establish the desired fluid limit. Since stochastic bound- 
edness is closely related to tightness, we start by reviewing tightness concepts. 
In the next section we apply the tightness notions to stochastic boundedness. 
The next three sections contain extra material not really needed for the current 
proofs. Additional material on tightness criteria appears in Whitt [73) . 

We work in the setting of a complete separable metric space (CSMS), also 
known as a Polish space; see §§13 and 19 of Billingsley [8], §§3.8-3.10 of Ethicr 
and Kurtz [19 and §§11.1 and 11.2 of [BJ- (The space D k = D([0, oo), R) k is 
made a CSMS in a standard way and the space of probability measures on D k 
becomes a CSMS as well.) Key concepts are: closed, compact, tight, relatively 
compact and sequentially compact. We assume knowledge of metric spaces and 
compactness in metric spaces. 

Definition 5.1 (tightness) A set A of probability measures on a metric space 
S is tight if, for all e > 0, there exists a compact subset K of S such that 



A set of random elements of the metric space S is tight if the associated set of 
their probability laws on S is tight. Consequently, a sequence {X n : n > 1} of 
random elements of the metric space S is tight if, for all e > 0, there exists a 
compact subset K of S such that 



Since a continuous image of a compact subset is compact, we have the fol- 
lowing lemma. 

Lemma 5.1 (continuous functions of random elements) Suppose that {X n : 
n > 1} is a tight sequence of random elements of the metric space S . If f : S — > 
S' is a continuous function mapping the metric space S into another metric 
space S' , then {f(X n ) : n > 1} is a tight sequence of random elements of the 
metric space S' . 



P(K) > 1 - e for all P eA . 



P(X n eK)>l-e for all n > 1 . 
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Proof. As before, let o be used for composition: (/ o g){x) = f(g(x)). For any 
function / : S — ► S' and any subset A of S, A C f~ l o f(A). Let e > be given. 
Since {X n : n > 1} is a tight sequence of random elements of the metric space 
S, there exists a compact subset if of S such that 

P(X„ G K) > 1 - e for all n > 1 . 

Then /(if) will serve as the desired compact set in S", because 

e f(K)) = P(x n e CT 1 o /)(«■)) > P(x n e X) > l - e 

for all n > 1. ■ 

We next observe that on products of separable metric spaces tightness is 
characterized by tightness of the components; see §11.4 of [69] . 

Lemma 5.2 (tightness on product spaces) Suppose that {(X n< x, . . . , X ni k) ■ 
n > 1} is a sequence of random elements of the product space Si x • • • x 
Sk, where each coordinate space Si is a separable metric space. The sequence 
{(X n> i, . . . ,X n< k) ■ n > 1} is tight if and only if the sequence {X n ^ : n > 1} is 
tight for each i, 1 < i < k. 

Proof. The implication from the random vector to the components follows 
from Lemma 15.11 because the component X n j is the image of the projection 
map TTi : S± x • • • x Sk — * Si taking (xi, . . . , Xk) into Xi, and the projection map 
is continuous. Going the other way, we use the fact that 

k k 

Ax X • • • X A k = p| vrr 1 ^) = f] ir^ 1 n (A x x ■ ■ ■ x A k ) 

t=l i=l 

for all subsets A^ C Si. Thus, for each i and any e > 0, we can choose Ki such 
that P(X n< i £ Ki) < e/k for all n > 1. We then let K\ x •• • x Kk be the desired 
compact for the random vector. We have 

P {{Xn,!, X n>k ) £K!X---xK k ) - P (\J{X n>i i Ki^j 

k 

< ^P(X nti <£Ki)<e . . 

i=l 

Tightness goes a long way toward establishing convergence because of Pro- 
horov's theorem. It involves the notions of sequential compactness and relative 
compactness. 

Definition 5.2 (relative compactness and sequential compactness) A subset A 
of a metric space S is relatively compact if every sequence {x n : n > 1} from 
A has a subsequence that converges to a limit in S {which necessarily belongs to 
the closure A of A). 
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We can now state Prohorov's theorem; see §11.6 of [69]. It relates compactness 
of sets of measures to compact subsets of the underlying sample space S on which 
the probability measures are defined. 

Theorem 5.1 (Prohorov's theorem) A subset of probability measures on a CSMS 
is tight if and only if it is relatively compact. 

We have the following elementary corollaries: 

Corollary 5.1 (convergence implies tightness) If X n X as n — > oo for 

random elements of a CSMS, then the sequence {X n : n > 1} is tight. 

Corollary 5.2 (individual probability measures) Every individual probability 
measure on a CSMS is tight. 

As a consequence of Prohorov's Theorem, we have the following method for 
establishing convergence of random elements: 

Corollary 5.3 (convergence in distribution via tightness) Let {X n : n > 1} be 

a sequence of random elements of a CSMS S. We have 

X n X in S as n — > oo 

if and only if (i) the sequence {X n : n > 1} is tight and (ii) the limit of every 
convergent subsequence of {X n : n > 1} is the same fixed random element X 
(has a common probability law). 

In other words, once we have established tightness, it only remains to show 
that the limits of all converging subsequences must be the same. With tightness, 
we only need to uniquely determine the limit. When proving Donsker's theorem, 
it is natural to uniquely determine the limit through the finite-dimensional dis- 
tributions. Convergence of all the finite-dimensional distributions is not enough 
to imply convergence on D, but it does uniquely determine the distribution of 
the limit; see pp 20 and 121 of Billingsley [5] and Example 11.6.1 in [55] . 

This approach is applied to prove the martingale FCLT stated in §S] see [73] . 
In the martingale setting it is natural instead to use the martingale character- 
ization of Brownian motion, originally established by Levy [42] and proved by 
Ito's formula by Kunita and Watanabe [38]; see p. 156 of Karatzas and Shreve 
[33] . and various extensions, such as to continuous processes with independent 
Gaussian increments, as in Theorem 1.1 on p. 338 of Ethier and Kurtz [TI5]. A 
thorough study of martingale characterizations appears in Chapter 4 of Liptscr 
and Shiryayev [44] and in Chapters VIII and IX of Jacod and Shiryayev [30]. 

We have not discussed conditions to have tightness; they are reviewed in [73] , 

5.2. Stochastic Boundedness 

We start by defining stochastic boundedness and relating it to tightness. 
We then discuss situations in which stochastic boundedness is preserved. Af- 
terwards, we give conditions for a sequence of martingales to be stochastically 
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bounded in D involving the stochastic boundedness of appropriate sequences 
of R- valued random variables. Finally, we show that the FWLLN follows from 
stochastic boundedness. 

5.2.1. Connection to Tightness 

For random elements of R and R fc , stochastic boundedness and tightness are 
equivalent, but tightness is stronger than stochastic boundedness for random 
elements of the functions spaces C and D (and the associated product spaces 
C k and D k ). 

Definition 5.3 (stochastic boundedness for random vectors) A sequence {X n : 
n > 1} of random vectors taking values mR fe is stochastically bounded (SB) 

if the sequence is tight, as defined in Definition 15.11 

The notions of tightness and stochastic boundedness thus agree for random 
elements of R fc , but these notions differ for stochastic processes. For a function 
x eD k = L>([0,oo),R) fe , let 

||x|| T = sup {\x(t)\} , 

0<t<T 

where |6| is a norm of b = (b\, 62, ■ ■ ■ , &fc) in K fe inducing the Euclidean topology, 
such as the maximum norm: \b\ = max I&2I, • ■ ■ , |^fc|}- (Recall that all norms 
on Euclidean space K fc are equivalent.) 

Definition 5.4 (stochastic boundedness for random elements of D k ) A se- 
quence {X n : n > 1} of random elements of D k is stochastically bounded 

in D k if the sequence of real-valued random variables {||X„||t : n > 1} is 
stochastically bounded in K for each T > 0, using Definition ^. Z\ . 

For random elements of D k , tightness is a strictly stronger concept than 
stochastic boundedness. Tightness of {X n } in D k implies stochastic bounded- 
ness, but not conversely; see §15 of Billingsely [8J. However, stochastic bound- 
edness is sufficient for us, because it alone implies the desired fluid limit. 

5.2.2. Preservation 

We have the following analog of Lemma 15.21 which characterizes tightness for 
sequences of random vectors in terms of tightness of the associated sequences 
of components. 

Lemma 5.3 (stochastic boundedness on D k via components) A sequence 

{(X n>1 , X nM ) : n > 1} in D k = D x • ■ ■ x D 

is stochastically bounded in D k if and only if the sequence {X n: i : n > 1} is 
stochastically bounded in D = D 1 for each i, 1 < i < k. 
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Proof. Assume that we are using the maximum norm on product spaces. We 
can apply Lemma 15.21 after noticing that 

|| (xi, . . .,x k )\\T = max{\\xi\\ T :l<i<k} 

for each element (x\, . . . , x k ) of D k . Since other norms are equivalent, the result 
applies more generally. ■ 

Lemma 5.4 (stochastic boundedness in D k for sums) Suppose that 

Y n {t) = X nA (t) + ■■■+ X ntk (t), t > 0, 

for each n > 1, where {(X ni i, . . . , X Urk ) : n > 1} is a sequence of random 
elements of the product space D k = D x • • • x D. If {X ny i : n > 1} is stochas- 
tically bounded in D for each i, 1 < i < k, then the sequence {Y n : n > 1} is 
stochastically bounded in D. 

Note that the converse is not true: We could have k — 2 with X n ^{t) = 
—X n ^(t) for all n and t. In that case we have Y n (t) = for all X n ^i(t). 

We now provide conditions for the stochastic boundedness of integral repre- 
sentations such as (J22). 

Lemma 5.5 (stochastic boundedness for integral representations) Suppose that 

X n (t) = X n (0) + y n ,i(t) + ■ • ■ + Y n>k {t) + [ h(X n (s)) ds, t>0, 

Jo 

where h is a Lipschitz function as in (|62[) and (X n (0), (Y ni i, . . . , Y njk )) is a 
random element of K x D k for each n > 1. If the sequences {X n (0) : n > 1} 
and {Y n ,i : n > 1} are stochastically bounded (in K. and D, respectively,) for 
1 < i < k, then the sequence {X n : n > 1} is stochastically bounded in D. 

Proof. For the proof here, it is perhaps easiest to imitate the proof of Theorem 
14.11 In particular, we will construct the following bound 

\\X n \\ T <Ke cT , 

assuming that \X n (0)\ +||F„,i||r H h ||F njfe || T +T|/i(0)| < K. As before, we 

apply Gronwall's inequality, Lemma [4.11 Inserting absolute values into all the 
terms of the integral representation, we have 

\X n {t)\ < \X n (0)\ + \Y ntl {t)\+--- + \Y n>k (t)\+ [ \h(X n {s))\ds, 

Jo 

< \X n {{))\ + \\Y n . 1 \\ T + --- + \\Y nM \\ T + Th{Q) + c ( \X n (s)\ds, 

Jo 

for t > 0, where c is the Lipschitz modulus. Suppose that |A„(0)| + ||1^,i||t + 
1- ||^n,fc||T + T\h(0)\ < K. By Gronwall's inequality, 

\X n {t)\<Ke ct and \\X n \\ T < Ke cT . m 
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5.2.3. Stochastic Boundedness for Martingales 



We now provide ways to get stochastic boundedness for sequences of martingales 
in D from associated sequences of random variables. Our first result exploits 
the classical submartingale-maximum inequality; e.g., see p. 13 of Karatzas and 
Shreve [33]. We say that a function / : R — > R is even if f(—x) — f(x) for all 
x G R. 

Lemma 5.6 (SB from the maximum inequality) Suppose that, for each n > I, 
M n = {M n (t) : t > 0} is a martingale (with respect to a specified filtration) 
with sample paths in D. Also suppose that, for each T > 0, there exists an even 
nonnegative convex function f : R — ► R with first derivative f'(t) > for t > 
(e.g., f(t) = t 2 ), there exists a positive constant K = K(T, f), and there exists 
an integer uq = no(T, /, K), such that 

E[f(M n (T))\ < K for all n > n . 

Then the sequence of stochastic processes {M n : n > 1} is stochastically bounded 
in D. 



Proof. Since any set of finitely many random elements of D is automatically 
tight, Theorem 1.3 of Billingsley |8J, it suffices to consider n > uq. Since / is 
continuous and f'(t) > for t > 0, t > c if and only if f(t) > /(c) for t > 0. 
Since / is even, 

E[f(M n (t))\ = E[f(\M n (t)\)\ < E[f(\M n (T)\)\ = E[f(M n (T))\ < K 

for alii, < t < T . Since these moments are finite and / is convex, the stochastic 
process {f(M n (t)) : < t < T} is a submartingale for each n > 1, so that we 
can apply the submartingale-maximum inequality to get 

P(\\M n \\ T > c) = P(||/o M n || T > /(c)) < ^ [/( ^" (r))] < K 



f(c) ~ /(c) 

for all n > uq. Since /(c) — > oo as c — > oo, we have the desired conclusion. ■ 

We now establish another sufficient condition for stochastic boundedness of 
square-integrable martingales by applying the Lenglart-Rebolledo inequality; 
see p. 66 of Liptser and Shiryayev [44] or p. 30 of Karatzas and Shreve (33] . 

Lemma 5.7 (Lenglart-Rebolledo inequality) Suppose that M = {M(t) : t > 0} 
is a square-integrable martingale (with respect to a specified filtration) with pre- 
dictable quadratic variation (M) = {(M)(t) : t > 0}, i.e., such that M 2 — (M) = 
{M(t) 2 — (M)(t) : t > 0} is a martingale by the Doob-Meyer decomposition. 
Then, for all c> and d > 0, 

P ( sup {\M(t)\} >c)< ±+P((M)(T) > d) . (81) 
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As a consequence we have the following criterion for stochastic boundedness 
of a sequence of square- integrable martingales. 

Lemma 5.8 (SB criterion for square-integrable martingales) Suppose that, for 
eachn > 1, M n = {M n (t) :t> 0} is a square-integrable martingale (with respect 
to a specified filtration) with predictable quadratic variation (M n ) = {(M n )(t) : 
t > 0}, i.e., such that M%-(M n ) = {M n (t) 2 -(M n )(t) : t > 0} is a martingale by 
the Doob-Meyer decomposition. If the sequence of random variables {(M n )(T) : 
n > 1} is stochastically bounded in R for each T > 0, then the sequence of 
stochastic processes {M n : n > 1} is stochastically bounded in D. 

Proof. For e > given, apply the assumed stochastic boundedness of the 
sequence {(M n )(T) : n > 1} to obtain a constant d such that 



Then for that determined d, choose c such that d/c 2 < e/2. By the Lenglart- 
Rebolledo inequality (|81| . these two inequalities imply that 



5.2.4- FWLLN from Stochastic Boundedness 

We will want to apply stochastic boundedness in D to imply the desired fluid 
limit in Lemmas 14.21 and 1 4.31 The fluid limit corresponds to a functional weak 
law of large numbers (FWLLN). 

Lemma 5.9 (FWLLN from stochastic boundedness in D k ) Let {X n : n > 1} be 

a sequence of random elements of D k . Let {a n : n > 1} be a sequence of positive 
real numbers such that a n — > oo as n — > oo. If the sequence {X n : n > 1} is 
stochastically bounded in D k , then 



where r)(t) = (0, 0, . . . , 0), t > 0. 

Proof. As specified in Definition 1 5. A\ stochastic boundedness of the sequence 
{X n : n > 1} in D k corresponds to stochastic boundedness of the associated 
sequence {||X„||t : n > 1} in R for each T > 0. By Definition 15. 3[ stochas- 
tic boundedness in R is equivalent to tightness. It is easy to verify directly 
that we then have tightness (or, equivalently, stochastic boundedness) for the 
associated sequence {H^nHr/on : n > 1} in R. By Prohorov's theorem, Theo- 
rem 15.11 tightness on R (or any CSMS) is equivalent to relative compactness. 
Hence consider a convergent subsequence {|| X nk Hr/on* : fc > 1} of the se- 
quence {||A„||T/a„ : n > 1} in R: \\X nk \\T/a nk — > L as k — > oo. It suffices to 
show that P(L = 0) = 1; then all convergent subsequences will have the same 



P ((M n )(T) > d) < e/2 for all n > 1 . 




— — => rj in D k as n — > oo , 
a n 



(82) 
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limit, which implies convergence to that limit. For that purpose, consider the 
associated subsequence {H^nJIr : k > 1} in K. It too is tight. So by Prohorov's 
theorem again, it too is relatively compact. Thus there exists a convergent sub- 
subsequence: \\X nk \\t => L' in E. It follows immediately that 

\\X ni IIt 

^0 in R as I -> oo . (83) 

This can be regarded as a consequence of the generalized continuous map- 
ping theorem (GCMT), Theorem 3.4.4 of [53], which involves a sequence of 
continuous functions: Consider the functions /„ : R — > R defined by f n (x) = 
x/a n , n > 1, and the limiting zero function / : R — > R defined by f(x) = for 
all x. It is easy to see that f n (x n ) f( x ) = whenever x n — > x in R. Thus 
the GCMT implies the limit in (|83| . Consequently, the limit L we found for 
the subsequence {H^nJIx/ans. : k > 1} must actually be 0. Since that must be 
true for all convergent subsequences, we must have the claimed convergence in 

(EH). . 

6. Completing the Proof of Theorem 11.11 

In the next three subsections we complete the proof of Theorem 11.11 by the 
martingale argument, as outlined at the end of i )4.2l In ij6.ll we show that the 
required fluid limit in Lemma 14 . 31 follows from the stochastic boundedness of the 
the sequence of stochastic processes {X n : n > 1}. In $6.2\ we finish the proof 
of the fluid limit by proving that the associated predictable quadratic variation 
processes are stochastically bounded. Finally, in M6.3I we show how to remove 
the condition on the initial conditions. 



6. 1 . Fluid Limit from Stochastic Boundedness in D 

We now show how to apply stochastic boundedness to imply the desired fluid 
limit in Lemma 14.31 Here we simply apply Lemma 15.91 with a n = \fn for n > 1 
to our particular sequence of stochastic processes {X n : n > 1} in (|3j) or ([32]) . 

Lemma 6.1 (application to queueing processes) Let X n be the random ele- 
ments of D defined in Q or (|32j) . If the sequence {X n : n > 1} is stochastically 
bounded in D , then 

||n _1 g„ - w\\ T =>■ as n -> oo for all T>0, (84) 

where u>(t) = 1, t > 0. Equivalently, 

sup {|(Q„(i)/n)-l|}=*0 as n -> oo for all T>0. (85) 

0<*<T 
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Proof. As a consequence of Lemma 15. 9( From the stochastic boundedness of 
{X n } in D, we will obtain 

— — => rj m L> as n — > oo , 
\fn 

where r\ is the zero function defined above. This limit is equivalent to (|84[) and 
(EH). - 

In i j4.2l we already observed that Lemma [431 implies the desired Lemma |4~21 
So we have completed the required proof. 

Recap: How this all goes together. The consequence of the last three sec- 
tions is as follows: In order to complete the martingale argument to establish the 
required fluid limit in Lemmas 14.21 and 14.31 it suffices to establish the stochastic 
boundedness of two sequences of random variables: {(M nj i)(T) : n > 1} and 
{(M n>2 )(T) : n > 1} for any T > 0. 

Here is an explanation: For each n > 1, the associated stochastic processes 
{(M ni i)(t) : t > 0} and {(M nj2 )(i) : t > 0} are the predictable quadratic 
variations (compensators) of the scaled martingales M n> \ and M„ 2 in ([29]) . 
(|5D|) and ([5^)1 . First, by Lemma [5.81 the stochastic boundedness of these two 
sequences of PQV random variables implies stochastic boundedness in D of the 
two sequences of scaled martingales {M n ^ : n > 1} and {M n ^ 2 ■ n > 1} in 
That, in turn, under condition ([3]), implies the stochastic boundedness in D of 
the sequence of scaled queue- length processes {X n : n > 1} in (|32|) and (|40| 
by Lemma [5.51 Finally, by Lemma [5.91 the stochastic boundedness of {X n } in 
D implies the required fluid limit in (f84|) . The fluid limit in (|84|) was just what 
was needed in Lemmas 14.21 and [4731 



6.2. Stochastic Boundedness of the Quadratic Variations 

We have observed at the end of the last section that it only remains to 
establish the stochastic boundedness of two sequences of PQV random variables: 
{(M nA )(t) : n > 1} and {(M„ )2 )(t) : n > 1} for any t > 0. For each n>l, the 
associated stochastic processes {(M n> i)(t) : t > 0} and {(M nj2 )(i) : t > 0} are 
the predictable quadratic variations (compensators) of the scaled martingales 
M n> i and M n; 2 in ([33|l . We note that the stochastic boundedness of {(M nj i)(t) : 
t > 0} is trivial because it is deterministic. 

Lemma 6.2 (stochastic boundedness of {(M n _2)} Under the assumptions in 
Theorems 11.11 and 13.41 the sequence {(M n ^}{t)} : n > 1}) is stochastically 
bounded for each t > 0, where 

(M n , 3 }(t) = ^ Q n {s)ds, t >0 . 
^ Jo 
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Proof. It suffices to apply the crude inequality in Lemma 13.31 

- / Q n {s)ds < ^(Q„(0) + A(Xnt)) . 
n Jo n 

By Lemma 15.41 it suffices to show that the two sequences {Q n (0)/n} and 
{A(Xnt)/n} are stochastically bounded. By condition {4]) and the CMT, we 
have the WLLN's 

Qn(0) - n ^ Qn(0) ^ 1 

— => as n — > oo so that =>■ 1 as n — > oo , 

n n 

but that convergence implies stochastic boundedness; see Corollarv l5.ll Turning 
to the other term, since A„ = n/i, we have 

A(X n t) 

> [it as n — > oo w.p.l 

n 

by the SLLN for Poisson processes. But convergence w.p.l implies convergence 
in distribution, which in turn implies stochastic boundedness; again apply Corol- 
lary [5H] Hence we have completed the proof. ■ 



6.3. The Initial Conditions 

The results in ?i|3T l6.1l have used the finite moment condition E[Q n (Q)] < oo, 
but we want to establish Theorem 1 1 . 1 1 without this condition. In particular, note 
that this moment condition appears prominently in Lemmas 13.41 and 13.51 and in 
the resulting final martingale representations for the scaled processes, e.g., as 
stated in Theorems 13.41 and 13.51 

However, for the desired Theorem I l.li we do not need to directly impose this 
moment condition. We do need the assumed convergence of the scaled initial 
conditions in ([4]), but we can circumvent this moment condition by defining 
bounded initial conditions that converge to the same limit. For that purpose, 
we can work with 

Qn(0) = Q„(0) A2n, n>0, 

and 

Y (r .s _ Qn{0) - Tl 

a„ = = , n > 1 . 

y/n 

Then, for each n > 1, we use Q n (0) and X n (0) instead of Q n (0) and X n (0). 
We then obtain Theorem 1 1.1 1 for these modified processes: X n X in D as 
n — > oo. However, P(X n ^ X n ) ^0 as n — * oo. Hence, we have X n X as 
well. 



6.4- Limit from the Fourth Martingale Representation 



In this section we state the FCLT limit for the general G/GI/oo queue stem- 
ming from the fourth martingale representation in £)3.7l but we omit proofs and 
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simply refer to Krichagina and Puhalskii [37 . (We do not call the FCLT a dif- 
fusion limit because the limit is not a diffusion process.) Even in the M/M/oo 
special case, the limit process has a different representation from the represen- 
tation in Theorcm ll.il We will show that the two representations are actually 
equivalent. 

Theorem 6.1 (FCLT from the fourth martingale representation) Let X n be 
defined in ([53} and let A n be defined in (46]) and (gt]). If X n (0) X(0) in 
R and A n =>• Z in D, where Z is a process with continuous sample paths and 
Z(0) = 0, then X n =>• X in D, where 

X(t) = F£(t)X{0) + ^/qJtijw°(F (t)) + M x {t) - M 2 (t), t > 0, (86) 

where W° = {W°(t) : < t < 1} is a Brownian bridge, U = {U(t, x),t>0,0< 
x < 1} is a Kiefer process, X(0), W° , Z and U are independent, 

Mi{t) = [ F c (t - s)dZ(s), t > 0, 
Jo 

and ^ ^ 

M 2 [t)= I I l{s + x<t)dU(a(s),F(x)), t>0. 
Jo Jo 

For M/M/oo queues, Z — ^fflB , where B is a standard Brownian motion 
and the limit process is 

X(t) = e-^X(0) + ^0jw°(l-e-^) + ^JI f e-^-^dBis) 

Jo 




t 

lis + xKtfdUins^-e-^), t>0. (87) 



Remark 6.1 A corresponding limit holds for the two-parameter processes X n = 
{X n (t, y)} in Corollaru \3.1\ by a minor variation of the same argument. 

Connection to Theorem II. 11 We now show that the two characterizations 
of the limit X in (f87|) and (|5|) are equivalent for the M/M/oo special case. For 
that purpose, express the process X in (|87| as X = Z\ + Z 2 + Z% + Z4, where 

Z 3 (t) = V» I e-^- s UB{s), 
Jo 




l(s + x < t)dU(ns, 1 - e-^ x ), t>0. 



Clearly, Z\ (t) can be written as 

Z 1 (t)=-(j, [ Z!(s)ds + X(0). (88) 
Jo 
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By the solution to the linear SDE, §5.6 in [33], we have 



Z 3 (t) = -/i f Z 3 (s)ds + ^lB(t), (89) 
Jo 

where B is again standard Brownian motion. 

Recall - see §5.6.B, [33J - that the Brownian bridge W° is the unique strong 
solution to the one-dimensional SDE 

Y(t) 

dY(t) = -j±-jdt + dB 2 (t), Y(0) = 0, 0<t<l, 

where B 2 is second independent standard Brownian motion. So we can write 

f x W° (n) 

W°(x)=- / "-L^tidy + B 2 (x) (90) 
1-y 



and it follows that 



, / fl-e W°(ll) \ 

W) = V^(0)(-/ n -^dy + B^l-e^)) 



o 

t 



= ^/W)( - V j W°{1 - e-» s )ds + B 2 {1 - e""*)) 

= -fi f Z 2 (s)ds + v ^(0)B 2 (l- e ^ t ). (91) 
Jo 

Paralleling (|90p . the Kiefer process U is related to the Brownian sheet W = 
{W(t,x) : t > 0,0 < x < 1} by 

U(t,x) = - [ U } t,V ^ dy + W(t, x), t>0, 0<x<l. (92) 
Jo 1 -V 

Hence, for t > 0, x > 0, we have 

U(ut, 1 - e-" x ) = - f U ^ yS) dy + Wffjct, 1 - e^ z ) 

Jo 1 - V 

= -fi [ U(fit, 1 - e'^dy + W{jit, 1 - e^ x ). 
Jo 

Next, by similar reasoning, it can be shown that 

Zi{t) = -fj,f Z 4 (s)ds+ [ [ l(s + x < t)dW((J,8,l - e-» x ). (93) 
Jo Jo Jo 

By (JHSJ), ijgSJ), dH]) and ((53J, we have 



= X ~ ^ X{s)ds + ^iB{t) + ^q^)B 2 {l~e-^) 
Jo 

+ / / l(s + ar < t)dW[ixs, 1 - e^ x ), (94) 
Jo Jo 



G. Pang, R. Talreja and W. Whitt/ Martingale Proofs 



244 



where B and Bi are independent standard Brownian motions. Let B be the 
sum of the last three components in i.e., 

B(t) = y/J!B(t) + V?(0)£ 2 (l-e-^)+ / / l(s + x < t)dW{^s,l-e-^ x ). 

Jo Jo 

It is evident that the process B is a continuous Gaussian process with mean 
and, for s < t, 

E[B(t) - B{s)f = 2/i(t - s) - (1 - g(0))(e-"« - e^ 4 ), 

and 

E[B(t)B(s)\ = 2 M ,s - (1 - ? (0))(1 - e ^ s ). 

However, in the M/M/oo case, we have q(0) = 1, so that E[B(t)B(s)] = 2/t(sA 
t), which implies that the process {B(t) : t > 0} is distributed the same as 
{^2jlB(t) : t > 0}, where B is a standard Brownian motion. Therefore, we have 
shown that the M/M/oo case of Theorem 16. II is consistent with Theorem ll.il 

7. Other Models 

In this section we discuss how to treat other models closely related to our 
initial M/M/oo model. We consider the Erlang- A model in ^7.11 we also consider 
limits for the waiting time there. We consider finite waiting rooms in $7.'2{ 
Finally, we indicate how to treat general non-Poisson arrival processes in ^7.31 

7.1. Erlang A Model 

In this section we prove the corresponding many-server heavy-traffic limit 
for the M/M/n/oo + M (or Erlang- A or Palm) model in the QED regime. As 
before, the arrival rate is A and the individual service rate is \i. Now there are n 
servers and unlimited waiting room with the FCFS service discipline. Customer 
times to abandon are i.i.d. exponential random variables with a mean of 1/9. 
Thus individual customers waiting in queue abandon at a constant rate 8. The 
Erlang-C model arises when there is no abandonment, which occurs when 8 = 0. 
The Erlang C model is covered as a special case of the result below. 

Let Q(t) denote the number of customers in the system at time i, either 
waiting or being served. It is well known that the stochastic process Q = {Q{t) : 
t > 0} is a birth-and-death stochastic process with constant birth rate Afc = A 
and state-dependent death rate Hk — (k A n)n + (fc — n) + 8, k > 0, where 
a Ab = min {a, 6}, a V b = max {a, b} and (a) + = a V for real numbers a and b. 

As in Theorem ll.l[ the many-server heavy-traffic limit involves a sequence 
of Erlang- A queueing models. As before, we let this sequence be indexed by n, 
but now this n coincides with the number of servers. Thus now we are letting 
the number of servers be finite, but then letting that number grow. At the same 
time, we let the arrival rate increase. As before, we let the arrival rate in model 
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n be A„, but now we stipulate that A„ grows with n. At the same time, we hold 
the individual service rate \x and abandonment rate 9 fixed. Let p„ = X n /nfi be 
the traffic intensity in model n. We stipulate that 

(1 — p n )\fn — ► /3 as n — > oo , (95) 

where p a (finite) real number. That is equivalent to assuming that 

n/j, - A„ 



Pfi as n — > oo , (96) 



as in (|7|). Conditions ([93]) and are known to characterize the QED many- 
server heavy-traffic regime; see Halfin and Whitt [55] and Puhalskii and Reiman 
[53] , The many-server heavy-traffic limit theorem for the Erlang-A model was 
proved by Garnett, Mandelbaum and Reiman [2T], exploiting Stone [52]. See 
Zeltyn and Mandelbaum [TJ] and Mandelbaum and Zeltyn [33] for extensions 
and elaboration. For related results for single-server models with customer aban- 
donment, see Ward and Glynn [6"51 l6"rJl IB"?] . 

Here is the QED many-server heavy-traffic limit theorem for the M/M/n/oo+ 
M model: 

Theorem 7.1 (heavy-traffic limit in D for the M/M/n + M model) Consider 
the sequence of M/M/n/oo + M models defined above, with the scaling in (I95[) . 
Let X n be as defined in ([3]). If X n (0) =*> X(0) in M as n — > oo, then X n =>■ X in 
D as n — ► oo, where X is the diffusion process with infinitesimal mean m(x) = 
—f3[i — \xx for x < and m(x) = —ftp — Ox for x > 0, and infinitesimal variance 
<y 2 {x) = 2p. Alternatively, the limit process X satisfies the stochastic integral 
equation 

X(t) = X(0) - nf3t + a/2/LB(*) - f [n(X(s) A 0) + 9(X(s) V 0)] ds 

Jo 

for t > 0, where B is a standard Brownian motion. Equivalently, X satisfies the 
SDE 



dX(t) = -fi/3 - fx(X(t) A 0)dt - 9(X{t) V Q)dt + y/2fidB(t), t > . 

Proof. In the rest of this section we very quickly summarize the proof; the 
argument mostly differs little from what we did before. Indeed, if we use the 
second martingale representation as in W2.2\ and 13.51 then there is very little 
difference. However, if we use the first martingale representation, as in ? ^2.1I 
and 13.41 then there is a difference, because now we want to use the optional 
stopping theorem for multiparameter random time changes, as in §§2.8 and 
6.2 of Ethier and Kurtz [19] . That approach follows Kurtz [40], which draws 
on Helms [5S]. That approach has been applied in §12 of Mandelbaum and 
Pats [IS] . To illustrate this alternate approach, we use the random time change 
approach here. 

Just as in i j2.ll we can construct the stochastic process Q in terms of rate-1 
Poisson processes. In addition to the two Poisson processes A and S introduced 
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before, now we have an extra rate-1 Poisson process R used to generate aban- 
donments. Instead of (|12[) . here we have representation 

Q(t) = Q(0) + A(Xt) -D(t)-L(t), t>0, 

t 



Q(0) + A(Xt) -S \u j (Q(s) A n) ds 
-r(o j (Q(s) - n) + ds) , ii)7i 



for t > 0, where D{t) is the number of departures (service completions) in 
the time interval [0, i], while L(t) is the number of customers lost because of 
abandonment in the time interval [0,t]. Since there are only n servers, the in- 
stantaneous overall service rate at time s is fi(Q(s) An), while the instantaneous 
overall abandonment rate (which is only from waiting customers, not yet in ser- 
vice) at time s is 9(Q(s) — n) + . 

Paralleling (|14j) . we have the martingale representation 

Q{t) = Q(0) + M^t) - M 2 (t) - M 3 (t) + At 

—fi j (Q(s) An)ds-0 j (Q(s)-n) + ds (98) 



for t > 0, where 



Mi(t) = A(At) - Ai, 

M 2 (t) = S J (Q(a) An)ds^ - fj, J {Q{s)An)ds, 

M 3 (t) = R ( e J o (Q( 3 ) - n ) + ~° J ~ n ) + ds (") 

for t > and the filtration is F = \Tt ■ t > 0} defined by 

T t = o (q(0), A(Xs), S U J (Q(u) A n) du^j , (100) 



R i° Jo ~ 1 ° ~ S ~ 1 



for t > 0, augmented by including all null sets. 

We now want to justify the claims in (|98 p - (|100[) . Just as before, we can apply 
Lemmas 13.11 and 13 .31 to justify this martingale representation, but now we need 
to replace Lemmas 13.21 and !3.4l bv corresponding lemmas involving the optional 
stopping theorem with multiparameter random time changes, as in §§2.8 and 
6.2 of [19]. We now sketch the approach: We start with the three-parameter 
filtration 



H = H(h,t 2 ,t 3 ) (101) 
= g (Q(0), A( Sl ), S(s 2 ),R{s 3 ) : < s x < t u < s 2 < t 2 , < s 3 < t 3 ) 
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augmented by adding all null sets. Next introduce the three nondecreasing non- 
negative stochastic processes 



h{t) = Xt, 



h{t) = fx / (Q(s)/\n)ds, 



I 3 (t) = 9 / (Q(s)-n)+ds, t> . 
Jo 

Then observe that the vector (Ii(t), I 2 (t), Is(t)) is an H-stopping time. (This is 
essentially by the same arguments as we used in Lemma EOl But here we directly 
gain control of the arrival process, because the event {I\ (t) < x{\ coincides with 
the requirement that {Xt < x\\, which ensures that we always have enough of 
the arrival process to construct Q(s), < s < t.) 

Moreover, since A, S and R are assumed to be independent rate-1 Poisson 
processes, the stochastic process 

M = (Mi,M 2 ,M 3 ) = {(Mi(si),M 2 (s2),M 3 (s 3 ) : «i > 0, s 2 > 0,s 3 > 0} 
where 

Mi(t) = A(t)-t, 
M 2 (t) = S(t)-t, 
M 3 (t) = R(t)-t, t>0, 

is an H-multiparamctcr martingale. As a consequence of the optional stopping 
theorem, Theorem 8.7 on p. 87 of [TU] . 

(Mi o I U M 2 o J 2 , M 3 o I 3 ) ee {Mxihit)), M 2 (I 2 (t)), M 3 (I 3 (t)) :t>0} 

is a martingale with respect to the filtration F ee {T t : t > 0} in (|100|) . because 

H(h(t),I 2 (t),I 3 (t)) 
= a (Q(0), A(si),S(s 2 ),R(s 3 ) : < 8l < h(t),0 < s 2 < J 2 (t),0 < s 3 < J 3 (t)) 

= <T[Q(Q),A(\s),s(fJL I [Q{u)An)du 



rLJ (Q(s) 



) - n)+ : < s < t } = T t for all t > . 



As in £|3.4[ we use the crude inequality in Lemma 13.31 to guarantee that the 
moment conditions are satisfied: 

E[Ij(t)] < oo and £[Mj(t)] < oo for j = 1,2,3. 

Just as before, we then consider the sequence of models indexed by n. Just 
as in ([28)1 - (|3"Tj) , we define associated scaled processes: 

M nil (t) = n- 1 / 2 [A(Ant) - A n i] , (102) 
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M n , 2 (t) = n- 1 ' 2 
M n , 3 (t) = n- 1 ' 2 



S ( A 1 / (Qn( s ) A n) ds ) — (j, / (Q„(s) A n) ds 



(Qn(s) - n) + ds 



R[0 (Q„(s) - n)+ ds 



for £ > 0. 

We thus obtain the following analog of Theorem 13.41 

Theorem 7.2 (first martingale representation for the scaled processes in the 
Erlang A model) If E[Q n (0)] < oc, then the scaled processes have the martingale 
representation 

(X n - [in)t 



X n (t) = X n (0) + M nA (t) - M n , 2 (t) - M nt3 {t) + 



\fJ.(X n (s) A 0) + 0X n {s) + ] ds, t>0 



(103) 



where M n ^ are the scaled martingales in (|102p . These processes M n ^ are square- 
integrable martingales with respect to the filtrations F n = {J- n .t ■ t > 0} defined 
by 



augmented by including all null sets. Their associated predictable quadratic vari- 
ations are 



a ( Q„(0), A(X n s), S^fiJ (Q„(u) A n) du 

R ( 6 ( (Q n {u) - n)+ du ] : < s < t ) , t > , (104) 



(M„,i)(t) = 


X n t 
n 




(M n , 2 )(t) = 


- [ (Q»(s)A 
n Jo 

6 /** 


n) ds, 


(M„, 3 )(t) = 


- / (Qn(s) ~ 

n Jo 


n) + ds, 



(105) 

where E[(M n ^) (t)] < oo for all i, t > and n > 1. 

The representation in (|103|) satisfies the Lipschitz condition in Theorem 14. II 
so that the integral representation is again a continuous mapping. In particular, 
here we have 

(A„ - fin)t 



X n {t) = X n (0) + M nA (t) - M n , 2 {t) - M n<3 (t) + 

+ f h{X n (s))ds , t > , 
Jo 

where h : R — > R is the function 

/i(s) = — /x(s A 0) — #(s) + , sel, 



(106) 



(107) 
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so that h is Lipschitz as required for Theorem l4.ll 

\h{si) - h{s 2 )\ <(/iV0)|si-s 2 | for all s u s 2 eR. 
Hence the proof can proceed exactly as before. Note that we have convergence 
(A„ - nn) 



as n — > oo 



for the deterministic term in (1106|) by virtue of the QED scaling assumption in 
©. 

The analog of Theorem l4.2l is the corresponding FCLT for three independent 
rate-1 Poisson processes, now including R as well as A and S. We now have 
three random-time-change processes: &A,n, &s.n and which here take the 

form: 



*A,n(«) = 


(M n , 1 )(t) = ^ > 
n 






*5,n(*) = 


(Mn.aXt) = £ Aq(«) 


A n) ds, 




$R,n(t) = 


(M n , 3 )(t) - - Aq(«) 


— n) + ds , 


(108) 


drawing upon ()l05(). Bv the 


same line of reasoning 


; as before, 


we obtain the 



deterministic limits 

®A,n(t) => Me, *S,n(*) A*e and $H,„(i) , (109) 

where, as before, e is the identity map e(t) = i, i > 0, and 77 is the zero 
function rj(t) = 0, t > 0. In particular, we again get the sequence {JT„ : n > 1} 
stochastically bounded in D by the reasoning in § ^5.2l and l6.2l Then, by Lemma 
15.91 and H6.ll we get the FWLLN corresponding to Lemma 14.31 Finally, from 
Lemma [4.31 we can prove (|109[) . just as we proved Lemma [4.2[ using analogs 
of the continuous map in (|71|) for the random time changes in (|108[) . The new 
functions for applications of the CMT are 

h 1 (x)(t) = fif (x(s)Al)ds and h 2 (x)(t) = 9 [ (x(s) - 1) + ds 
Jo Jo 

for t > 0. 

Paralleling (|73|) . here we have 



(MA,n,®A,n,Ms, n ,$S,n,M R , n ,<f> R , n ) (B u (J£, B 2 , fie, B 3 ,T)) in D 6 

as n — ► 00. Hence we can apply the CMT with composition just as before. 
Paralleling (174")) , here we obtain first 

(Ma,„ o ® A ,n, Ms.n ° $S,n, M R>n ° ®R,n) (B\ o fie, B 2 O jj,e, B 3 o 7]) 
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in D 3 



as n 



oo, and then 



{Ma,u ° $A,n - M S ,n ° $S,n - Mr,u ° $R,n) => (#1 o /j,e - B 2 o fie - B 3 o r)) 

in D. However, 



Finally, the CMT with the integral representation (|103[) and Theorem 14. 1 1 com- 
pletes the proof. Note that the limiting Brownian motion associated with R 
does not appear, because is asymptotically negligible. That is why the 

infinitesimal variance is the same as before. ■ 

7.2. Finite Waiting Rooms 

We can also obtain stochastic-process limits for the number of customers in 
the system in associated M/M/n/0 (Erlang-B), M/M fn/m n and M /M /n/m n + 
M models, which have finite waiting rooms. For the Erlang-i? model, there is 
no waiting room at all; for the other models there is a waiting room on size 
m n in model n, where m n is allowed to grow with n so that m n /y/n — > k > 
as 11 -> 00 as in j8}. The QED many-server heavy-traffic limit was stated as 
Theorem O 

The proof can be much the same as in Q7.1\ The idea is to introduce the finite 
waiting room via a reflection map, as in §§3.5, 5.2, 13.5, 14.2, 14.3 and 14.8 of 
Whitt [5S], corresponding to an upper barrier at k, but the reflection map here 
is more complicated than for single-server queues and networks of such queues, 
because it is not applied to a free process. We use an extension of Theorem l4.ll 
constructing a mapping from Dxl into D 2 , taking model data into the content 
function and the upper-barrier regulator function. 

Theorem 7.3 (a continuous integral representation with reflection) Consider 
the modified integral representation 



Jo 

where x(t) < k, h : M —> M satisfies h(0) — and is a Lipschitz function as in 
(|62p. and u is a nondecreasing nonnegative function in D such that (| 1 holds 
and 



The modified integral representation in (| 1 1 1[) has a unique solution (x, u), so that 
it constitutes a bonafide function /a) : D x K — > D x D mapping (y, b) into 
x = fi{y,b) and u = f2(y,b). In addition, the function (/i,/2) is continuous 
provided that the product topology is used for product spaces and the function 
space D (in both the domain and range) is endowed with either: (i) the topology 
of uniform convergence over bounded intervals or (ii) the Skorohod J\ topology. 
Moreover, if y is continuous, then so are x and u. 




(110) 




(111) 
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Proof. We only show the key step, for which we follow the argument in §3 of 
Mandelbaum and Pats [15] and §4 of Reed and Ward [S5] ; see these sources for 
additional details and references. The idea is to combine classical results for the 
conventional one-dimensional reflection map, as in §§5.2 and 13.5 of [69] with a 
modification of Theorem 4.1. Let (<f> K ,ip K ) be the one-sided reflection map with 
upper barrier at k, so that <fi K (y) = V — ^k(j/)> with (f> K (y) being the content 
function and ip K (y) being the nondecreasing regulator function; see §§5.2 and 
13.5 of [55] ■ We observe that the map in (|111|) can be expressed as x = <fi K (w) 
and u = ip K (w), where 



This lets us represent the desired map as the composition of the maps (<p K , ip K ) 
and £. The argument to treat £ is essentially the same as in the proof of Theorem 
14. 1[ but we need to make a slight adjustment; we could apply it directly if we 
had h : D — > D in Theorem 14.11 Recall that (f> K is Lipschitz continuous on 
D([0,t] for each t with the uniform norm, \\4> K (yi) - 4> K (y2)\\t < 2\\yi - 2/2 1 1 * , 
with modulus 2 independent of i. Hence, paralleling ([6"4"]l . we have 



for each t > 0. Hence we can apply GronwalPs inequality in Lemma 14.11 to 
establish (Lipschitz) continuity of the map £ on D([0,T]) x M. Combining this 
with the known (Lipschitz) continuity of the reflection map (</> K , ip K ), we have the 
desired continuity for the overall map in the uniform topology. We can extend 
to the J\ topology as in the proof of Theorem 14.11 ■ 

Now that we understand how we are treating the finite waiting rooms, the 
QED many-server heavy-traffic limit theorem is as stated in Theorem 11.21 This 
modification alters the limiting diffusion process in Theorem 17.11 only by the 
addition of a reflecting upper barrier at k for the sequence of models with 
waiting rooms of size m„, where k = for the Erlang-B model. When n = 0, X 
is a reflected OU (ROU) process. Properties of the ROU process are contained 
in Ward and Glynn [SB]. Proofs for the two cases n > and n — by other 
methods are contained in §4.5 of Whitt [72] and Theorem 4.1 of Srikant and 
Whitt [61]. General references on reflection maps are Lions and Sznitman [43] 
and Dupuis and Ishii [18] . 

Proof of Theorem 11.21 We briefly sketch the proof. Instead of ([97]) , here we 
have representation 




(112) 



II wi - io 2 ||t < \h - b 2 \ + 1 1 ?/i - 2/2 IU + 2c / ||iui - w 2 \\ s ds 




Qn(t) 



Q n (0) + A(X n t) 



D n (t) - L n (t) - U n (t), t>0 




(113) 
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for t > 0, where U n (t) is the number of arrivals in the time interval [0,t] when 
the system is full in model n, i.e., when Q n (t) = n + m n . In particular, 

U n (t)= f l {Qn{s)=n+mn} dA(X n s), t>0. (114) 



To connect to Theorem 17. 3[ it is significant that U n can also be represented as 
the unique nondecreasing nonnegative process such that Q n (t) < n-\- m n , (|113|) 
holds and 

/>OC 

^{Q n (t)<n}dU n (t)^0 . (115) 

We now construct a martingale representation, just as in (|98l) - (|102p . The 
following is the natural extension of Theorem 17.21 



Theorem 7.4 (first martingale representation for the scaled processes in the 
M / M / n/ m n + M model) If n < oo, then the scaled processes have the martingale 
representation 

X n (t) = X n (0) + M w ,i(t) - M n<2 (t) - M n . 3 (t) + (A " 

[fx{X n {s) A 0) + 9X n (s)+] ds - V n (t), t > , (116) 



M n> i are the scaled martingales in (|102[) and 

t>0, (117) 

for U n in (|113|) - (|115p . The scaled processes M n> i are square-integrable martin- 
gales with respect to the filtrations F n = \JF n ,t '■ t > 0} defined by 



F n ,t = & yQn(0),A(\ n s),S yfi J (Q n (u) A n) du j , 

R[0 [ (Qn(u) -n)+du ] : < s < i ] , t>0, (118) 



augmented by including all null sets. Their associated predictable quadratic vari- 
ations are 

(M nil )(t) - ^, 
n 



(M n , 2 >(t) = ^ / (Q„(s) An)ds, 
n Jo 

(M„ l3 )(t) = - [ (Qn(s)-n)+ds, i>0, (119) 
" Jo 

where E[(M n ^) (t)] < oo /or aZ/ i, t > and n > 1. 
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By combining Theorems 17.31 and \7 A\ we obtain the joint convergence 

(X n , V n ) => (X, U) in D 2 as n — > oo , 

for X n and V n in pi6|) and (|117p , where the vector (X, U) is characterized by 
© and ([TOj). That implies Theorem O stated in 

Remark 7.1 (correction) The argument in this section follows Whitt [72], but 
provides more detail. We note that the upper-barrier regulator processes are 
incorrectly expressed in formulas (5.2) and (5.8) of [72"] . 



7.3. General Non-Markovian Arrival Processes 



In this section, following §5 of Whitt [72], we show how to extend the many- 
server heavy-traffic limit from M/M/n/m n + M models to G/M/n/m n + M 
models, where the arrival processes are allowed to be general stochastic point 
processes satisfying a FCLT. They could be renewal processes (GI) or even more 
general arrival processes. The limit of the arrival-process FCLT need not have 
continuous sample paths. (As noted at the end of §4.3[ this separate argument 
is not needed if we do not use martingales.) 

Let A n = {A n (t) : t > 0} be the general arrival process in model n and let 

An(t)= Mt) ~ Xn \ t>0, (120) 

be the associated scaled arrival process. We assume that 

A n => A in D as n — > oo . (121) 

We also assume that, conditional on the entire arrival process, model n evolves 
as the Markovian queueing process with i.i.d. exponential service times and i.i.d. 
exponential times until abandonment. 
Thus, instead of Theorem 17.41 we have 

Theorem 7.5 (first martingale representation for the scaled processes in the 
G/M/n/m n + M model) Consider the family ofG/AI/n/m n + M models defined 
above, evolving as a Markovian queue conditional on the arrival process. Ifm n < 
oo, then the scaled processes have the martingale representation 

X n (t) = X n (0) + A n (t) - M n>2 (t) - M„, 3 (t) + " 1111)1 



In 

\+1 



[fi(X n (a) A 0) + 0X n (s)+] ds - V n (t), t > , (122) 

where A n is the scaled arrival process in (I120p . M n ,i are the scaled martingales in 
(H52D and V n (t) ee U n (t)/^/n, t > 0, for U n in (fTT^ ]) -([TT5 |) . The scaled processes 
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M„ t i are square-integrable martingales with respect to the filtrations F n = {J- n ,t ■ 
t > 0} defined by 

T n ,t = a\Q n (0),{A n (u):u>0},s(^^(Q n (u)An)du\ , 

R\6 (Q n {u) - n) + du^j :0<s<tj , t>0 7 (123) 

augmented by including all null sets. The associated predictable quadratic vari- 
ations of the two martingales are 

(M n . 2 )(t) = H / (Q n (s)An)ds, 
n Jo 

(M n . 3 )(t) = - f (Q n (s) - n)+ ds, t>0, (124) 
n Jo 

where E[(M n ^){f)\ < oo for all i, t > and n > 1. 

Here is the corresponding theorem for the G/M/n/m n + M model. 

Theorem 7.6 (heavy-traffic limit in D for the G/M/n/m n + M model) Con- 
sider the sequence ofG/M/n/m„ +M models defined above, with the scaling in 
(fT20|) , © and ©. Let X n be as defined in §3$. If 

X„(0)=>X(0) in R and A n => A in D as n — > oo , 

then X n =4> X in D as n — > oo, where the limit process X satisfies the 
stochastic integral equation 

X(t) = X(0)+A(t)- (3fit + ^/JIB(t) 

-[ [n(X(s) A 0) + 6(X(s) V 0)] ds - U{t) (125) 
Jo 

for t > with B = {B{t) : t > 0} being a standard Brownian motion and U 
being the unique nondecreasing nonnegative process in D such that X(t) < k for 
all t, (|125|) holds and 




l{x(t)<K}dU(t) = . 



Proof. We start with the FCLT for the arrival process assumed in ()121[) and 
(|120j) . We condition on possible realizations of the arrival process: For each 
n > 1, let ( n be a possible realization of the scaled arrival process A n and let ( 
be a possible realization of the limit process A. Let X^ n be the scaled process 
X n conditional on A n = £„, and let X^ be the limit process X conditional on 
A = (. 

Since X n and A n are random elements of D, these quantities X!^ 1 and X^ are 
well defined via regular conditional probabilities; e.g., see §8 of Chapter V, pp 
146-150, of Parthasarathy [51]. In particular, we can regard P(X n e -\A n = z) 
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as a probability measure on D for each z E D and we can regard P(X n E 
B\A n — z) as a measurable function of z in D for each Borel set B in £>, where 

P(A„ E B) = ( P(X n E B\A n = z) dP(A n = z) . 
Jb 

And similarly for the pair (X, A) . 

A minor modification of the previous proof of Theorem 11.21 establishes that 

X%"- X^ in D whenever £ n — > £ in £> ; 

i.e., for each continuous bounded real- valued function / on D, 

E{f(Xt)]^E{f(X<)} as n^oo (126) 

whenever — > £ in D. Now fix a continuous bounded real-valued function / 
and let 

MCn) = E[f(X^)\ and h(0 = E[f(Xi)}. (127) 

Since we have regular conditional probabilities, we can regard the functions h n 
and h as measurable functions from D to R (depending on /). 

We are now ready to apply the generalized continuous mapping theorem, The- 
orem 3.4.4 of Since h n and h are measurable functions such that h n (( n ) ~* 
h(() whenever £„ — * ( and since A n A in D, we have h n (A n ) h(A) as 
n oo. Since the function / used in (|126[) and (|127|) is bounded, these random 
variables are bounded. Hence convergence in distribution implies convergence 
of moments. Hence, for that function /, we have 

E[f(X n )} = E[h n {A n )} -» E[h(A)] = E[f(X)] as n -» oo . 

Since this convergence holds for all continuous bounded real-valued functions / 
on D, we have shown that X n => X , as claimed. ■ 

8. The Martingale FCLT 

We now turn to the martingale FCLT. For our queueing stochastic-process 
limits, it is of interest because it provides one way to prove the FCLT for a 
Poisson process in Theorem 14.21 and because we can base our entire proof of 
Theorem ll.ll on the martingale FCLT. However, the gain in the proof of Theorem 
II. H is not so great. 

We now state a version of the martingale FCLT for a sequence of local mar- 
tingales {M n : n > 1} in D k , based on Theorem 7.1 on p. 339 of Ethier and 
Kurtz [19], hereafter referred to as EK. Another important reference is Jacod 
and Shiryayev [30], hereafter referred to as JS. See Section VIII. 3 of JS for 
related results; see other sections of JS for generalizations. 

We will state a special case of Theorem 7.1 of EK in which the limit pro- 
cess is multi-dimensional Brownian motion. However, the framework always 
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produces limits with continuous sample paths and independent Gaussian in- 
crements. Most applications involve convergence to Brownian motion. Other 
situations are covered by JS, from which we see that proving convergence to 
discontinuous processes is more complicated. 

The key part of each condition below is the convergence of the quadratic 
covariation processes. Condition (i) involves the optional quadratic-covariation 
(square-bracket) processes [M n> i, M n j], while condition (ii) involves the pre- 
dictable quadratic-covariation (angle-bracket) processes (M n j, M n j). Recall from 
£13.21 that the square-bracket process is more general, being well defined for 
any local martingale (and thus any martingale), whereas the associated angle- 
bracket process is well defined only for any locally square-integrable martingale 
(and thus any square-integrable martingale). 

Thus the key conditions below are the assumed convergence of the quadratic- 
variation processes in conditions (|130|) and (|133|) . The other conditions (|129p . 
(I13ip and (|133p are technical regularity conditions. There is some variation 
in the literature concerning the extra technical regularity conditions; e.g., see 
Rebolledo [54] and JS. 

Let J be the maximum- jump function, defined for any x £ D and T > 

by 

J(x,T)= sup {\x(t)-x(t-)\ :0<t<T} . (128) 

Theorem 8.1 (multidimensional martingale FCLT) For n > 1, let M n = 

{M n: i, . . . , M ni k) be a local martingale in D with respect to a filtration F n = 
{J- n .t : t > 0} satisfying M„(0) = (0, . . . , 0). Let C = (c,j) be a kxk covariance 
matrix, i.e., a nonnegative- definite symmetric matrix of real numbers. 

Assume that one of the following two conditions holds: 

(i) The expected value of the maximum jump in M n is asymptotically negli- 
gible; i.e., for each T > 0, 

lim {E[J(M n ,T)}}=0 (129) 

n — >oo 

and, for each pair with 1 < i < k and I < j < k, and each t > 0, 

[M nii , M n j] (t) =*> Cj jt in R as n — > oo . (130) 



(ii) The local martingale M n is locally square-integrable, so that the pre- 
dictable quadratic- covariation processes (M nt i, M nj -) can be defined. The ex- 
pected value of the maximum jump in (M Ut i, M n j) and the maximum squared 
jump of M n are asymptotically negligible; i.e., for each T > and (i,j) with 
1 < i < k and 1 < j < k, 

lim {E [J {{M n , h M Hij ),T)}} = , (131) 

} = , (132) 



lim \ E J(M n ,T) 2 
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and 



(M nt i, M n j)(t) Cijt in R as n — > oo 



(133) 



for each t > and for each 



Conclusion: 



If indeed one of the the conditions (i) or (ii) above holds, then 



M n M in D 



.k 



as 



n 



oo 



where M is a k- dimensional (0, C)-Brownian motion, having mean vector 
and covariance matrix 



where, for a matrix A, A tr is the transpose. 

Of course, a common simple case arises when C is a diagonal matrix; then 
the k component marginal one-dimensional Brownian motions are independent. 
When C — I, the identity matrix, M is a standard fc-dimensional Brown- 
ian motion, with independent one-dimensional standard Brownian motions as 
marginals. 

At a high level, Theorem 18.11 savs that, under regularity conditions, conver- 
gence of martingales in D is implied by convergence of the associated quadratic 
covariation processes. At first glance, the result seems even stronger, because we 
need convergence of only the one-dimensional quadratic covariation processes 
for a single time argument. However, that is misleading, because the stronger 
weak convergence of these quadratic covariation processes in D k is actually 
equivalent to the weaker required convergence in R for each t,i,j in conditions 
(pOj) and (JT33J); see [73]. 

9. Applications of the Martingale FCLT 

In this section we make two applications of the preceding martingale FCLT. 
First, we apply it to prove the FCLT for the scaled Poisson process, Theorem 
14.21 Then we apply it to provide a third proof of Theorem ll.il In the same way 
we could obtain alternate proofs of Theorems 11.21 and 17.11 

9.1. Proof of the Poisson FCLT 

We now apply the martingale FCLT to prove the Poisson FCLT in Theorem 
14.21 To do so, it suffices to consider the one-dimensional version in D, since the 
Poisson processes are mutually independent. Let the martingales M n = Ma,u 
be as defined in ([55]) . i.e., 



E[M(t)] = (0, . . . , 0) and E[M(t)M(t) tr ] = Ct, t > , 



M n = M Atn (t) 



A{nt) - nt 



t > , 
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with their internal nitrations. The limits in (|129| and ()132ll hold, because 

J(M n , T)<-^= and J(M*T) < -, n > 1 . 
yjn n 

For each n > 1, M n is square integrable, the optional quadratic variation process 
is 

[M n ](t)= [M n ,M n ](t) = , t > , 

n 

and the predictable quadratic variation process is 

nt 

(M n )(t) = (M n , M n )(t) = — =t, t>0. 

n 

Hence both (|131|) and (|133[) hold trivially. By the SLLN for a Poisson process, 
A(nt)/n — > t w.p.l as n — > oo for each t > 0. Hence both conditions (i) and (ii) 
in Theorem 18. II are satisfied, with C = c\ \ = 1. ■ 

9. 2. Completing the Proof of Theorem \1.1\ 

The bulk of this paper has consisted of a proof of Theorem 11.11 based on the 
first martingale representation in Theorem 13.41 which in turn is based on the 
representation of the service-completion counting process as a random time 
change of a rate-1 Poisson process, as in (fT2|) . A second proof in $4. 31 established 
the fluid limit directly. 

In this subsection we present a third proof of Theorcm l 1 . 1 I bascd on the second 
martingale representation in Theorem 13. 51 which in turn is based on a random 
thinning of rate-1 Poisson processes, as in (|14p . This third proof also applies 
to the third martingale representation in £13.61 which is based on constructing 
martingales for counting processes associated with the birth-and-death process 
{Q(t) : t > 0} via its infinitesimal generator. 

Starting with the second martingale representation in Theorem 13.51 (or the 
third martingale representation in Subsection l3.6p , we cannot rely on the Poisson 
FCLT to obtain the required stochastic process limit 

(M n ,i,M ni2 ) =► (^/J1B 1 ,^/J1B 2 ) in D 2 as n -> oo , (134) 

in (|59D . However, we can apply the martingale FCLT for this purpose, and we 
show how to do that now. 

As in £19.11 we can apply either condition (i) or (ii) in Theorem 18. 11 but it is 
easier to apply (ii), so we will. The required argument looks more complicated 
because we have to establish the two-dimensional convergence in (|134p in D 2 
because the scaled martingales M n> i and M n> 2 are not independent. Fortunately, 
however, they are orthogonal, by virtue of the following lemma. That still means 
that we need to establish the two-dimensional limit in (|134|) . but it is not difficult 
to do so. 
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We say that two locally square-integrable martingales with respect to the 
filtration F, Mi and Mi, are orthogonal if the process M\M 2 is a local mar- 
tingale with Mi(0)M 2 (0) = 0. Since M X M 2 - (M X ,M 2 ) is a local martingale, 
orthogonality implies that (Mi, M 2 )(t) = for all t. 

Lemma 9.1 (orthogonality of stochastic integrals with respect to orthogonal 
martingales) Suppose that M\ and M2 are locally square-integrable martingales 
with respect to the filtration F, where Mi(0) = M2(0) = 0, while C\ and C 2 are 
locally-bounded F -predictable processes. If Mi and M 2 are orthogonal, (which 
is implied by independence), then the stochastic integrals J* C\(s) dMi(s) and 
J* C2(s) dM2(s) are orthogonal, which implies that 



and 



C x (s)dMi(s), / C 2 (s)dM 2 (s))(t)=0, t>0. 



(t) = 0, t > 



Ci(s)dMi(s), / C 2 (s)dM 2 (s 



Lemma 19.11 follows from the following representation for the quadratic covari- 
ation of the stochastic integrals; see §5.9 of van der Vaart [64] . 

Lemma 9.2 (Quadratic covariation of stochastic integrals with respect to mar- 
tingales) Suppose that Mi and M 2 are locally square-integrable martingales with 
respect to the filtration F, while C\ and C 2 are locally-bounded F-predictable 
processes. Then 



Ci(s)dMi(s) / C 2 (s)dM 2 (s) 
Jo 

Ci(s)dMi(s), I ' C 2 (s)dM 2 (s) 



(t) : t > 



and 



Ci(s)dMi(s) / C 2 (s)dM 2 (s) 
Jo 

Ci(s)dMi(s), j C 2 (s)dM 2 (s)\ (t):t>0 
are local F- martingales, where the quadratic covariation processes are 



Ci(s)dMi(s), / C 2 (s)dM 2 (s) 



(t) = / Ci(s)C 2 (s)d[Mi,M 2 }(s) 



and 



Ci(s)dMi(s), / C 2 (s)dM 2 (s))(t) 



Ci(s)C 2 (s)d(Mi,M 2 )(s), t>0 
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As a consequence of the orthogonality provided by Lemma 19. 1) we have 
[M nt i, M n ^2](t) = and (M Ut i, M n ^) it) = for all t and n for the martin- 
gales in (I134[) , which in turn come from Theorem 13.51 Thus the orthogonality 
trivially implies that 

[M n>1 ,M n>2 ] =>• and {M nA , M n>2 )(f) in R as n -> oo 

for alH > 0. We then have 

(M nj i, M nt i)(t) => Ci^t = /it in R as n — ► oo 

for each i and i = 1, 2 by (|41 [) in Theorem [33] and Lemma l4.21 as in the previous 
argument used in the first proof of Theorem II. II in ^4.1116.21 As stated above, 
the bulk of the proof is thus identical. By additional argument, we can also show 
that 

[M n> i, M n> i\{£) =>■ Ci^t — \it in R as n — > oo . 
starting from H2l) . 

We have just shown that (| 1 33|) holds. It thus remains to show the other con- 
ditions in Theorem EH] (ii) are satisfied. First, since we have a scaled unit-jump 
counting process, condition (|132p holds by virtue of the scaling in Theorem 
13.51 and ([30]) . Next (|13ip holds trivially because the predictable quadratic vari- 
ation processes (M n> i) and {M n ^) are continuous. Hence this third proof is 
complete. ■ 

In closing this section, we observe that this alternate method of proof also 
applies to the Erlang-^4 model in §7 .1\ and the generalization with finite waiting 
rooms in Theorem 11.21 
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APPENDIX 

Appendix A: Proof of Lemma 13.11 

Let A be the jump function, i.e., AX(t) = X(t) — X(t—) and let 



]T(AX( S )) 2 , t>0, 
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represent the sum of all squared jumps. Since (AI(s)) 2 > 0, the sum is indepen- 
dent of the order. Hence the sum is necessarily well defined, although possibly 
infinite. (For any positive integer n, a function in D has at most finitely many 
jumps with \x(t) — x(t— )| > 1/n over any bounded interval; see Lemma 1 on p. 
110 of Billingsley [8]. Hence, the sample paths of each stochastic process in D 
have only countably many jumps.) 

The optional quadratic variation has a very simple structure for locally- 
bounded- variation processes, i.e., processes with sample paths of bounded vari- 
ation over every bounded interval; see (18.1) on p. 27 of Rogers and Williams 



Lemma A.l (optional quadratic covariation and variation for a locally-bounded- 
variation process) // stochastic processes X and Y almost surely have sample 
paths of bounded variation over each bounded interval, then 

[X,Y}(t)=^2(^X( s ))(AY( s )), t>0, (135) 

s<t 

and 

[l](t)E[X,X](t) = ^(AX( s )) 2 , t>0. (136) 

s<t 

Lemma A. 2 (optional quadratic variation for a counting process) If N is a 
non- explosive unit- jump counting process with compensator A, both adapted to 
the filtration F, then N — A is a locally square-integrable martingale of locally 
bounded variation with square-bracket process 



[N-A](t) = ^(A(7V-A)( S )) 2 (137) 

s<t 

= N(t)-2 [ AA{s)dN{s)+ [ AA(s)dA(s), t>0. 
Jo Jo 

If, in addition, the compensator A is continuous, then 

[N — A] = N , (138) 

Proof. For (|137[) . we apply (I136|) in Lemma [A. 11 see §5.8 of van der Vaart 
[6"i] . For (|138|) , we observe that the last two terms in (|137|) become when A is 
continuous. ■ 



Proof of Lemma 13.11 We will directly construct the compensator of the 
submartingale {M(t) 2 : t > 0}. Since (i) TV is a non-explosive counting process, 
(ii) E[N(t)) < oo and i?[^4(i)] < oo for all t and (iii) N and A have nondecreasing 
sample paths, the sample paths of the martingale M = N — A provided by 
Theorem 13. H are of bounded variation over bounded intervals. Since iV(0) = 0, 
M(0) = 0. We can thus start by applying integration by parts, referred to as 
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the product formula on p. 336 of Bremaud [12], to write 
M{t) 2 = [ M(s-)dM(s)+ [ M(s)dM(s) 



Jo 

M(s-) dM{s) + / (M(s) - M(s-)) dM[s) 
io Jo 

= 2/ M(s-)dM(s) + [M](t) 







t 

M(s-)dM(s) + N(t), t>0, (139) 



The last step follows from (|138[) in Lemma [A. 21 

We now want to show that the stochastic integral J Q M(s-) dM(s) is a mar- 
tingale. To get the desired preservation of the martingale structure, we can apply 
the integration theorem, Theorem T6 on p. 10 of [12], but we need additional 
regularity conditions. At this point, we localize in order to obtain boundedness. 

To obtain such bounded martingales associated with N and M = N — A, let 
the stopping times be defined in the obvious way by 

T m = M{t > : \M(t)\ > m or A(t) > m} . (140) 

Then {r m < t} e T u t > 0. We now define the associated stopped processes: 
Let 

M m (t) = M(t A T m ), N m (t) = N{t Ar m ) and A m (t) = A(t A r m ) (141) 

for all t > and m > 1. Then N m (t) = M m (t) + A m (t) for t > and {M m (t) : 
t > 0} is a martingale with respect to {.Ft} having compensator {A m (t) : t > 0} 
for each m > 1, as claimed. Moreover, all three stochastic processes N m , M m 
and A m are bounded. The boundedness follows since N has unit jumps and A 
is continuous. 

We then obtain the representation for these stopped processes corresponding 
to (| 139(1 : in particular, 

M m {tf = 2 f M m {s-) dM m {s) + N m (t), t > , (142) 
Jo 

for each m > 1. With the extra boundedness provided by the stopping times, we 
can apply Theorem T6 on p. 10 of 12J to deduce that the integral 
J* M m (s—)dM m (s) is an F-martingale. First, M m (t) is a martingale of inte- 
grable bounded variation with respect to F, as defined on p. 10 of 12]. Moreover, 
the process {M m (t— ) : t > 0} is an F-predictable process such that 



\M m (s-)\d\M m \(s) < 



00 



11 



By Theorem T6 on p. 10 of [T2], the integral f* M m (s-)dM m (s) is an F- 
martingale. Thus {M m (t) 2 - N m {t) : t > 0} is an F-martingale. But {N m (t) ~ 
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A m (t) : t > 0} is also an F-martingale. Adding, we see that {M m (t) 2 - A m (t) : 
t > 0} is an F-martingale for each m. Thus, for each m, the predictable quadratic 
variation of M m is (M rn }(t) = A m (t), t > 0. 

Now we can let m t oo and apply Fatou's Lemma to get 



E[M(tf] = E lim M m (tf 



< lim inf E 

m — >oo 



M m {tf 



lim inf E 

rn — >oo 



A m {t) 



E 



A{t) 



< oo. 



Therefore, M itself is square integrable. We can now apply the monotone con- 
vergence theorem in the conditioning framework, as on p. 280 of [12j . to get 



E[M m (t- 



\T t ] E[M(t + sf\T t ] and E[A rn {t + s)\T t ] -> E[A(t + s)\F t 



aira^oo for each t > and s > 0. Then, since 

E[M m {t + sf - A rn {t + s)|jF t ] = M m {tf - A rn (t) for all m > 1 , 

M m (t) -> M(t) and A m (<) -> we have 

E[M(t + sf - A(t + s)\F t ] = M{t) 2 - A(t) 

as well, so that M 2 — A is indeed a martingale. Of course that implies that 
(M) = A, as claimed. We get [M] — N from Lemma IA.2| as noted at the 
beginning of the proof. ■ 

We remark that there is a parallel to Lemma IA.2I for the angle-bracket pro- 
cess, applying to cases in which the compensator is not continuous. In contrast 
to Lemma [3Tl we now do not assume that E[N(t)] < oo, so we need to localize. 



Lemma A. 3 (predictable quadratic variation for a counting process) If N is a 
non- explosive unit- jump counting process with compensator A, both adapted to 
the filtration F, then N — A is a locally square-integrable martingale of locally 
bounded variation with angle-bracket process 

(N-A)(t) = {[N-A])(t) (143) 

= A(t)- [ AA(s)dA(s)= [ (l-AA(s))dA{s), i>0. 
Jo Jo 

If in addition, the compensator A is continuous, then 

(N - A) = A . (144) 



Proof. For pl3]) . we exploit the fact that (N — A) = ([N - A}); see p. 377 of 
Rogers and Williams [55] and §5.8 of van der Vaart [53]. The third term on the 
right in (| 1 3T[) is predictable and thus its own compensator. The compensators 
of the first two terms in (| 13T[) are obtained by replacing N by its compensator 
A. See Problem 3 on p. 60 of Liptser and Shiryayev [44]. ■ 



